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Abstract 

Bohmian mechanics is the most naively obvious embedding imaginable of Schrodinger's 
^ ' equation into a completely coherent physical theory. It describes a world in which particles 

move in a highly non-Newtonian sort of way, one which may at first appear to have little 
to do with the spectrum of predictions of quantum mechanics. It turns out, however, 
that as a consequence of the defining dynamical equations of Bohmian mechanics, when a 
system has wave function ip its configuration is typically random, with probability density 
p given by the quantum equilibrium distribution. It also turns out that the entire 
quantum formalism, operators as observables and all the rest, naturally emerges in Bohmian 
mechanics from the analysis of "measurements." This analysis reveals the status of operators 
as observables in the description of quantum phenomena, and facilitates a clear view of the 
range of applicability of the usual quantum mechanical formulas. 
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* Dedicated to Elliott Lieb on the occasion of his 70th birthday. Elliott will be (we fear unpleasantly) surprised 
to learn that he bears a greater responsibility for this paper than he could possibly imagine. We would of course 
like to think that our work addresses in some way the concern suggested by the title of his recent talks. The 
Quantum- Mechanical World View: A Remarkably Successful but Still Incomplete Theory, but we recognize that 
our understanding of incompleteness is much more naive than Elliott's. He did, however, encourage us in his 
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1 Introduction 

It is often argued that the quantum mechanical association of observables with self-adjoint oper- 
ators is a straightforward generalization of the notion of classical observable, and that quantum 
theory should be no more conceptually problematic than classical physics once this is appre- 
ciated. The classical physical observables — for a system of particles, their positions q = (q/c), 
their momenta p = (pk), and the functions thereof, i.e., functions on phase space — form a com- 
mutative algebra. It is generally taken to be the essence of quantization, the procedure which 
converts a classical theory to a quantum one, that q, p, and hence all functions f{q,p) thereof 
are replaced by appropriate operators, on a Hilbert space (of possible wave functions) associated 
with the system under consideration. Thus quantization leads to a noncommutative operator 
algebra of "observables," the standard examples of which are provided by matrices and linear 
operators. Thus it seems perfectly natural that classical observables are functions on phase space 
and quantum observables are self-adjoint operators. 

However, there is much less here than meets the eye. What should be meant by "measuring" 
a quantum observable, a self-adjoint operator? We think it is clear that this must be specified — 
without such specification it can have no meaning whatsoever. Thus we should be careful here 
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and use safer terminology by saying that in quantum theory observables are associated with self- 
adjoint operators, since it is difficult to see what could be meant by more than an association, by 
an identification of observables, regarded as somehow having independent meaning relating to ob- 
servation or measurement (if not to intrinsic "properties"), with such a mathematical abstraction 
as a self-adjoint operator. 

We are insisting on "association" rather than identification in quantum theory, but not in 
classical theory, because there we begin with a rather clear notion of observable (or property) 
which is well-captured by the notion of a function on the phase space, the state space of complete 
descriptions. If the state of the system were observed, the value of the observable would of 
course be given by this function of the state (g,p), but the observable might be observed by 
itself, yielding only a partial specification of the state. In other words, measuring a classical 
observable means determining to which level surface of the corresponding function the state of 
the system, the phase point — which is at any time definite though probably unknown — belongs. 
In the quantum realm the analogous notion could be that of function on Hilbert space, not self- 
adjoint operator. But we don't measure the wave function, so that functions on Hilbert space 
are not physically measurable, and thus do not define "observables." 

The problematical character of the way in which measurement is treated in orthodox quantum 
theory has been stressed by John Bell: 

The concept of 'measurement' becomes so fuzzy on refiection that it is quite surprising 
to have it appearing in physical theory at the most fundamental level. Less surpris- 
ing perhaps is that mathematicians, who need only simple axioms about otherwise 
undefined objects, have been able to write extensive works on quantum measurement 
theory — which experimental physicists do not find it necessary to read. . . . Does 
not any analysis of measurement require concepts more fundamental than measure- 
ment? And should not the fundamental theory be about these more fundamental 
concepts? jS] 

... in physics the only observations we must consider are position observations, if 
only the positions of instrument pointers. It is a great merit of the de Broglie-Bohm 
picture to force us to consider this fact. If you make axioms, rather than definitions 
and theorems, about the 'measurement' of anything else, then you commit redundancy 
and risk inconsistency. jH] 

The Broglie-Bohm theory, Bohmian mechanics, is a physical theory for which the concept 
of 'measurement' does not appear at the most fundamental level — in the very formulation of 
the theory. It is a theory about concepts more fundamental than 'measurement,' in terms of 
which an analysis of measurement can be performed. In a previous work [23] we have shown 
how probabilities for positions of particles given by lipl"^ emerge naturally from an analysis of 
"equilibrium" for the deterministic dynamical system defined by Bohmian mechanics, in much 
the same way that the Maxwellian velocity distribution emerges from an analysis of classical 
thermodynamic equilibrium. Our analysis entails that Born's statistical rule p = should 
be regarded as a local manifestation of a global equilibrium state of the universe, what we call 
quantum equilibrium, a concept analogous to, but quite distinct from, thermodynamic equilib- 
rium: a universe in quantum equilibrium evolves so as to yield an appearance of randomness, 
with empirical distributions in agreement with all the predictions of the quantum formalism. 

While in our earlier work we have proven, from the first principles of Bohmian mechanics, 
the "quantum equilibrium hypothesis" that when a system has wave function ip, the distribution 
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p of its configuration satisfies p = our goal here is to show that it follows from this 

hypothesis, not merely that Bohmian mechanics makes the same predictions as does orthodox 
quantum theory for the results of any experiment, but that the quantum formalism of operators 
as observables emerges naturally and simply as the very expression of the empirical import of 
Bohmian mechanics. 

More precisely, we shall show here that self-adjoint operators arise in association with specific 
experiments: insofar as the statistics for the values which result from the experiment are con- 
cerned, the notion of self-adjoint operator compactly expresses and represents the relevant data. 
It is the association ^^S' ^-^ A" between an experiment <^ and an operator A — an association that 
we shall establish in Section 2 and upon which we shall elaborate in the other sections — that is the 
central notion of this paper. According to this association the notion of operator-as-observable 
in no way implies that anything is measured in the experiment, and certainly not the operator 
itself. We shall nonetheless speak of such experiments as measurements, since this terminology 
is unfortunately standard. When we wish to emphasize that we really mean measurement — the 
ascertaining of the value of a quantity — we shall often speak of genuine measurement. 

Much of our analysis of the emergence and role of operators as observables in Bohmian 
mechanics, including the von Neumann-type picture of measurements at which we shall arrive, 
applies as well to orthodox quantum theory. Indeed, the best way to understand the status 
of the quantum formalism — and to better appreciate the minimality of Bohmian mechanics — is 
Bohr's way: What are called quantum observables obtain meaning only through their association 
with specific experiments. We believe that Bohr's point has not been taken to heart by most 
physicists, even those who regard themselves as advocates of the Copenhagen interpretation. 

Indeed, it would appear that the argument provided by our analysis against taking operators 
too seriously as observables has even greater force from an orthodox perspective: Given the 
initial wave function, at least in Bohmian mechanics the outcome of the particular experiment 
is determined by the initial configuration of system and apparatus, while for orthodox quantum 
theory there is nothing in the initial state which completely determines the outcome. Indeed, we 
find it rather surprising that most proponents of standard quantum measurement theory, that is 
the von Neumann analysis of measurement [73], beginning with von Neumann, nonetheless seem 
to retain an uncritical identification of operators with properties. Of course, this is presumably 
because more urgent matters — the measurement problem and the suggestion of inconsistency and 
incoherence that it entails — soon force themselves upon one's attention. Moreover such difficulties 
perhaps make it difficult to maintain much confidence about just what should be concluded from 
the "measurement" analysis, while in Bohmian mechanics, for which no such difficulties arise, 
what should be concluded is rather obvious. 

Moreover, a great many significant real- world experiments are simply not at all associated with 
operators in the usual way. Because of these and other difficulties, it has been proposed that we 
should go beyond operators-as-observables, to generalized observables, described by mathematical 
objects (positive-operator- valued measures, POVMs) even more abstract than operators (see, e.g., 
the books of Davies [21], Holevo 50^ and Kraus [SH])- It may seem that we would regard this 
development as a step in the wrong direction, since it supplies us with a new, much larger class 
of abstract mathematical entities about which to be naive realists. We shall, however, show that 
these generalized observables for Bohmian mechanics form an extremely natural class of objects 
to associate with experiments, and that the emergence and role these observables is merely an 
expression of quantum equilibrium together with the linearity of Schrodinger's evolution. It is 
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therefore rather dubious that the occurrence of generahzed observables — the simplest case of 
which are self-adjoint operators — can be regarded as suggesting any deep truths about reality or 
about epistemology. 

As a byproduct of our analysis of measurement we shall obtain a criterion of measurability and 
use it to examine the genuine measurability of some of the properties of a physical system. In this 
regard, it should be stressed that measurability is theory-dependent: different theories, though 
empirically equivalent, may differ on what should be regarded as genuinely measurable within 
each theory. This important — though very often ignored — point was made long ago by Einstein 
and has been repeatedly stressed by Bell. It is best summarized by Einstein's remark "It is 
the theory which decides what we can observe. " 

We note in passing that measurability and reality are different issues. Indeed, for Bohmian 
mechanics most of what is "measurable" (in a sense that we will explain) is not real and most of 
what is real is not genuinely measurable. (The main exception, the position of a particle, which 
is both real and genuinely measurable, is, however, constrained by absolute uncertainty |2S])- 

In focusing here on the role of operators as observables, we don't wish to suggest that there 
are no other important roles played by operators in quantum theory. In particular, in addition to 
the familiar role played by operators as generators of symmetries and time-evolutions, we would 
like to mention the rather different role played by the field operators of quantum field theory: to 
link abstract Hilbert-space to space-time and structures therein, facilitating the formulation of 
theories describing the behavior of an indefinite number of particles 1^ . 

Finally, we should mention what should be the most interesting sense of measurement for a 
physicist, namely the determination of the coupling constants and other parameters that define 
our physical theories. This has little to do with operators as observables in quantum theory and 
shall not be addressed here. 

Notations and Conventions 

Q = (Qiy ■ ■ ■ :Qn) denotes the actual configuration of a system of N particle with positions 
Qfc! Q = (qi5 • • • 5 Qtv) is its generic configuration. Whenever we deal with a system-apparatus 
composite, x {X) will denote the generic (actual) configuration of the system and y (Y) that 
of the apparatus. Sometimes we shall refer to the system as the x-system and the apparatus 
as the y-sjstem. Since the apparatus should be understood as including all systems relevant to 
the behavior of the system in which we are interested, this notation and terminology is quite 
compatible with that of Section 12.21 in which y refers to the environment of the x-system. 

For a system in the state will denote the quantum equilibrium measure, p^{dq) = 

|*(g)|2dg. If Z = F{Q) then p| denotes the measure induced by F, i.e. Pq, = o F ^. 



2 Bohmian Experiments 

According to Bohmian mechanics, the complete description or state of an A^-particle system 
is provided by its wave function \l/(g, t), where q = (qi, . . . , qA?) G M^^, and its configuration 
Q = (Qi, . . . , Qat) G M'^^, where the are the positions of the particles. The wave function, 
which evolves according to Schrodinger's equation, 

^h— = H^, (2.1) 
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choreographs the motion of the particles: these evolve according to the equation 



— — = — Im Qi,...,Qjv 2.2 

at rrik WW 

where Vfc = d/dqi:. In equation ()2.1|1 . H is the usual nonrelativistic Schrodinger Hamiltonian; 
for spinless particles it is of the form 

H = -y^ + V, (2.3) 

containing as parameters the masses mi . . . ,mN of the particles as well as the potential energy 
function V of the system. For an iV-particle system of nonrelativistic particles, equations ()2.H1 
and ()2.2|) form a complete specification of the theory (magnetic fields^ and spin,^ as well as Fermi 
and Bose-Einstein statistics,^ can easily be dealt with and in fact arise in a natural manner 
jHl El ISH 12H1)- There is no need, and indeed no room, for any further axioms, describing 
either the behavior of other observables or the effects of measurement. 



2.1 Equivariance and Quantum Equilibrium 

It is important to bear in mind that regardless of which observable one chooses to measure, the 
result of the measurement can be assumed to be given configurationally, say by some pointer 
orientation or by a pattern of ink marks on a piece of paper. Then the fact that Bohmian 
mechanics makes the same predictions as does orthodox quantum theory for the results of any 
experiment — for example, a measurement of momentum or of a spin component — provided we 
assume a random distribution for the configuration of the system and apparatus at the beginning 
of the experiment given by |\l/(g)p — is a more or less immediate consequence of ()2.2|) . This is 
because of the quantum continuity equation 

^ + divJ'^ = 0, 
ot 

which is a simple consequence of Schrodinger's equation. Here J* = (Jf , . . . , J*) with 

J* = — Im(W*VfcW) 

rrik 

the quantum probability current. This equation becomes the classical continuity equation 

^ + diYpv = (2.4) 

for the system of equations dQ/dt = v defined by (j2.2j) — governing the evolution of the probability 
density p under the motion defined by the guiding equation ()2.2|) for the particular choice p = 

^When a magnetic field is present, the gradients in the equations (|2.1I and 1)2. 2|l must be understood as 
the covariant derivatives involving the vector potential A. 
2 See Section O 

^For indistinguishable particles, a careful analysis [2H| of the natural configuration space, which is no longer 
R"^^, leads to the consideration of wave functions on M.^^ that are either symmetric or antisymmetric under 
permutations. 
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|v[>|2 _ v]/*vi/. In other words, if the probabihty density for the configuration satisfies p{q,tQ) = 
\'^{Q:to)\'^ some time to, then the density to which this is carried by the motion (j2.2|) at any 
time t is also given by p{q,t) = This is an extremely important property of any 

Bohmian system, as it expresses a certain compatibility between the two equations of motion 
defining the dynamics, which we call the equivariance^ of I^E'p. 

The above assumption guaranteeing agreement between Bohmian mechanics and quantum 
mechanics regarding the results of any experiment is what we call the "quantum equilibrium 
hypothesis" : 

When a system has wave function its configuration Q is random with probability . , 
distribution given by the measure pq,(dq) = |\E'(g)p(ig. ^ ' ^ 

When this condition is satisfied we shall say that the system is in quantum equilibrium and we 
shall call p^. the quantum equilibrium distribution. While the meaning and justification of ()2.5|) 
is a delicate matter, which we have discussed at length elsewhere [25J, it is important to recognize 
that, merely as a consequence of ()2.2j) and ()2.5j) . Bohmian mechanics is a counterexample to all 
of the claims to the effect that a deterministic theory cannot account for quantum randomness 
in the familiar statistical mechanical way, as arising from averaging over ignorance: Bohmian 
mechanics is clearly a deterministic theory, and, as we have just explained, it does account for 
quantum randomness as arising from averaging over ignorance given by [^'(g)^. 



2.2 Conditional and Effective Wave Functions 

Which systems should be governed by Bohmian mechanics? An n-particle subsystem of an A^- 
particle system (n < N) need not in general be governed by Bohmian mechanics, since no wave 
function for the subsystem need exist. This will be so even with trivial interaction potential V, 
if the wave function of the system does not properly factorize; for nontrivial V the Schrodinger 
evolution would in any case quickly destroy such a factorization. Therefore in a universe governed 
by Bohmian mechanics there is a priori only one wave function, namely that of the universe, and 
there is a priori only one system governed by Bohmian mechanics, namely the universe itself. 

Consider then an iV-particle non relativistic universe governed by Bohmian mechanics, with 
(universal) wave function \E'. Focus on a subsystem with configuration variables x, i.e., on a 
splitting q = {x, y) where y represents the configuration of the environment of the x-system. The 
actual particle configurations at time t are accordingly denoted by Xt and Yt, i.e., Qt = {Xt, Yt). 
Note that = "^ti^jy)- How can one assign a wave function to the x-system? One obvious 
possibility — afforded by the existence of the actual configuration — is given by what we call the 

Equivariance can be formulated in very general terms: consider the transformations L/ : ^' — s- and 
f '■ Q ~^ f{Q)i where U is & unitary transformation on LF'{dq) and / is a transformation on configuration space 
that may depend on 5*. We say that the map 5* s- /i^ from wave functions to measures on configuration space is 
equivariant with respect to U and / if nu^ — ° /^^- The above argument based on the continuity equation 
H2.4|l shows that 5* j^E'pcig is equivariant with respect to U = Ut = e~*R^, where H is the Schrodinger 
Hamiltonian (|2.3I) and f = ft is the solution map of 12.211 . In this regard, it is important to observe that for a 
Hamiltonian H which is not of Schrodinger type we shouldn't expect (|2.2|l to be the appropriate velocity field, that 
is, a field which generates an evolution in configuration space having j^Pp as equivariant density. For example, 
for H — Cj-^, where c is a constant (for simplicity we are assuming configuration space to be one-dimensional), 
we have that j^'p is equivariant provided the evolution of configurations is given by dQ/dt = c. In other words, 
for Ut = e'^*^ the map ^ i-^ l^'pdq is equivariant if /t : Q ^ Q + ct. 
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conditional wave function of the x-system 



M^) = ^t{x,Yt). (2.6) 

To get familiar with this notion consider a very simple one dimensional universe made of two 
particles with Hamiltonian {h = 1) 

H = Hi" + Hi"' + Hi"" = + ^) + - yf. 

and initial wave function 

1 x"^ 1 y'^ 

\]/g = -0 (g) $0 with ijj{x) = TT~^e~~ and $o(y) = ^r^^e"^. 
Then (pTT|) and are easily solved: 

y) = TT^^ (1 + it)"^e"^ [(""-y^^+^JT^] , 

= a(t)X + b{t)Y and Ft = b{t)X + a{t)Y, 

where a{t) = + + 1], b{t) = |[(1 + — 1], and X and Y are the initial positions of the 
two particles. Focus now on one of the two particles (the x-system) and regard the other one as 
its environment (the y-system). The conditional wave function of the x-system 

depends, through Yt, on both the initial condition Y for the environment and the initial condition 
X for the particle. As these are random, so is the evolution of ipt, with probability law determined 
by |\l/oP- In particular, ipt does not satisfy Schrodinger's equation for any 

We remark that even when the x-system is dynamically decoupled from its environment, its 
conditional wave function will not in general evolve according to Schrodinger's equation. Thus 
the conditional wave function lacks the dynamical implications from which the wave function of 
a system derives much of its physical significance. These are, however, captured by the notion of 
effective wave function: 

Suppose that \I^(a;,?/) = ilj{x)^{y) + '^^{x,y) , where $ and \E'"'" have macroscop- 

ically disjoint y -supports. If Y E supp $ we say that tp is the effective wave (2.7) 

function of the x-system. 

Of course, ip is also the conditional wave function since nonvanishing scalar multiples of wave 
functions are naturally identified.^ 

^Note that in Bohmian mechanics the wave function is naturally a projective object since wave functions 
differing by a multiplicative constant — possibly time-dependent — are associated with the same vector field, and 
thus generate the same dynamics. 
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2.3 Decoherence 



One might wonder why systems possess an effective wave function at alL In fact, in general they 
don't! For example the x-system will not have an effective wave function when, for instance, 
it belongs to a larger microscopic system whose effective wave function doesn't factorize in the 
appropriate way. However, the larger the environment of the x-system, the greater is the potential 
for the existence of an effective wave function for this system, owing in effect to the abundance 
of "measurement-like" interaction with a larger environment.^ 

We remark that it is the relative stability of the macroscopic disjointness employed in the 
definition of the effective wave function, arising from what are nowadays often called mechanisms 
of decoherence — the destruction of the coherent spreading of the wave function, the effectively 
irreversible flow of "phase information" into the (macroscopic) environment — which accounts 
for the fact that the effective wave function of a system obeys Schrodinger's equation for the 
system alone whenever this system is isolated. One of the best descriptions of the mechanisms of 
decoherence, though not the word itself, can be found in Bohm's 1952 "hidden variables" paper 



Decoherence plays a crucial role in the very formulation of the various interpretations of 
quantum theory loosely called decoherence theories (Griffiths jlH], Omnes jHS], Leggett jHOl, Zurek 
j79j . Joos and Zeh Gell-Mann and Hartle jSS])- In this regard we wish to emphasize, however, 
as did Bell in his article "Against Measurement" JT], that decoherence in no way comes to grips 
with the measurement problem itself, being arguably a necessary, but certainly not a sufficient, 
condition for its complete resolution. In contrast, for Bohmian mechanics decoherence is purely 
phenomenological — it plays no role whatsoever in the formulation (or interpretation) of the theory 
itself^ — and the very notion of effective wave function accounts at once for the reduction of the 
wave packet in quantum measurement. 

According to orthodox quantum measurement theory [731 ^3 [THl EZl , after a measurement, 
or preparation, has been performed on a quantum system, the x-system, the wave function for 
the composite formed by system and apparatus is of the form 



with the different $0 supported by the macroscopically distinct (sets of) configurations corre- 
sponding to the various possible outcomes of the measurement, e.g., given by apparatus pointer 
orientations. Of course, for Bohmian mechanics the terms of ()2.8p are not all on the same 
footing: one of them, and only one, is selected, or more precisely supported, by the outcome — 
corresponding, say, to ao — which actually occurs. To emphasize this we may write ()2.8|) in the 

^To understand how this comes about one may suppose that initially the y-supports of <& and ^E"^ (of. the 
definition above of effective wave function) are just "sufficiently" (but not macroscopically) disjoint. Then, due 
to the interaction with the environment, the amount of y-disjointness will tend to increase dramatically as time 
goes on, with, as in a chain reaction, more and more degrees of freedom participating in this disjointness. When 
the effect of this "decoherence" is taken into account, one finds that even a small amount of y-disjointness will 
often tend to become "sufficient," and quickly "more than sufficient," and finally macroscopic. 

''However, decoherence plays an important role in the emergence of Newtonian mechanics as the description of 
the macroscopic regime for Bohmian mechanics, supporting a picture of a macroscopic Bohmian particle, in the 
classical regime, guided by a macroscopically well-localized wave packet with a macroscopically sharp momentum 
moving along a classical trajectory. It may, indeed, seem somewhat ironic that the gross features of our world 
should appear classical because of interaction with the environment and the resulting wave function entanglement, 
the characteristic quantum innovation. 
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form 

(g) $ + \1>-^ 

where = ifjao, $ = $ao; — Sq^oq ® comparison with ()2.7j) it follows that 

after the measurement the x-system has effective wave function ip^Q- This is how collapse (or 
reduction) of the effective wave function to the one associated with the outcome arises in 
Bohmian mechanics. 

While in orthodox quantum theory the "collapse" is merely superimposed upon the unitary 
evolution — without a precise specification of the circumstances under which it may legitimately 
be invoked — we have now, in Bohmian mechanics, that the evolution of the effective wave function 
is actually given by a stochastic process, which consistently embodies both unitarity and collapse 
as appropriate. In particular, the effective wave function of a subsystem evolves according to 
Schrodinger's equation when this system is suitably isolated. Otherwise it "pops in and out" 
of existence in a random fashion, in a way determined by the continuous (but still random) 
evolution of the conditional wave function ipf Moreover, it is the critical dependence on the 
state of the environment and the initial conditions which is responsible for the random behavior 
of the (conditional or effective) wave function of the system. 

2.4 Wave Function and State 

As an important consequence of ()2.6|) we have, for the conditional probability distribution of 
the configuration Xt of a system at time t, given the configuration of its environment, the 
fundamental conditional probability formula 25 : 

Probvt,, {Xt e dx I Yt) = dx, (2.9) 

where 

Prob*o(rfg) = \^oiQ)\^dQ, 

with Q = {X, Y) the configuration of the universe at the (initial) time t = 0. Formula ()2.9|1 is 
the cornerstone of our analysis [23] on the origin of randomness in Bohmian mechanics. Since 
the right hand side of ()2.9|) involves only the effective wave function, it follows that the wave 
function ipt of a subsystem represents maximal information about its configuration Xt. In other 
words, given the fact that its wave function is ipti it is in principle impossible to know more about 
the configuration of a system than what is expressed by the right hand side of (j2.9|) . even when 
the detailed configuration Yt of its environment is taken into account P3] 

Prob*o(Xt G dx \ Yt) = Prob^,, (Xt G dx \ ipt) = |V't(x)|^ rfx. (2.10) 

The fact that the knowledge of the configuration of a system must be mediated by its wave 
function may partially account for the possibility of identifying the state of a system — its com- 
plete description — with its wave function without encountering any practical difficulties. This is 
primarily because of the wave function's statistical role, but its dynamical role is also relevant 
here. Thus it is natural, even in Bohmian mechanics, to regard the wave function as the "state" 
of the system. This attitude is supported by the asymmetric roles of configuration and wave 
function: while the fact that the wave function is ip entails that the configuration is distributed 
according to the fact that the configuration is X has no implications whatsoever for the 
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wave function.^ Indeed, such an asymmetry is grounded in the dynamical laws and in the initial 
conditions, ip is always assumed to be fixed, being usually under experimental control, while X 
is always taken as random, according to the quantum equilibrium distribution. 

When all is said and done, it is important to bear in mind that regarding ip as the "state" 
is only of practical value, and shouldn't obscure the more important fact that the most de- 
tailed description — the complete state description — is given (in Bohmian mechanics) by the wave 
function and the configuration. 

2.5 The Stern-Gerlach Experiment 

Information about a system does not spontaneously pop into our heads or into our (other) 
"measuring" instruments; rather, it is generated by an experiment: some physical interaction 
between the system of interest and these instruments, which together (if there is more than one) 
comprise the apparatus for the experiment. Moreover, this interaction is defined by, and must 
be analyzed in terms of, the physical theory governing the behavior of the composite formed 
by system and apparatus. If the apparatus is well designed, the experiment should somehow 
convey significant information about the system. However, we cannot hope to understand the 
significance of this "information" — for example, the nature of what it is, if anything, that has 
been measured — without some such theoretical analysis. 

As an illustration of such an analysis we shall discuss the Stern-Gerlach experiment from 
the standpoint of Bohmian mechanics. But first we must explain how spin is incorporated into 
Bohmian mechanics: If is spinor-valued, the bilinear forms appearing in the numerator and 
denominator of (j2.2|) should be understood as spinor- inner-pro ducts; e.g., for a single spin ^ 
particle the two-component wave function 



where ( ■ , ■ ) denotes the scalar product in the spin space C^. The wave function evolves via 
(j2.ip . where now the Hamiltonian H contains the Pauli term, for a single particle proportional 
to B ■ cr, that represents the coupling between the "spin" and an external magnetic field B; here 
cr = (cr^, ay,az) are the Pauli spin matrices which can be taken to be 



Let's now focus on a Stern-Gerlach "measurement of the operator az": An inhomogeneous 
magnetic field B is established in a neighborhood of the origin, by means of a suitable arrangement 
of magnets. This magnetic field is oriented in the positive z-direction, and is increasing in this 
direction. We also assume that the arrangement is invariant under translations in the x-direction, 
i.e., that the geometry does not depend upon x-coordinate. A particle with a fairly definite 

*The "fact" (that the configuration is X) shouldn't be confused with the "knowledge of the fact" : the latter 
does have such implications |25| ! 




generates the velocity 
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momentum is directed towards the origin along the negative y-axis. For simphcity, we shall 
consider a neutral spin-1/2 particle whose wave function \1/ evolves according to the Hamiltonian 



H = V^-uCT-B. (2.12) 

2m ^ ^ ^ 

where /x is a positive constant (if one wishes, one might think of a fictitious electron not feeling 
the Lorentz force). 

The inhomogeneous field generates a vertical deflection of away from the y-axis, which for 
Bohmian mechanics leads to a similar deflection of the particle trajectory according to the velocity 
field defined by (12.111) : if its wave function \& were initially an eigenstate of of eigenvalue 1 or 
— 1, i.e., if it were of the form 

^(+) ^ ^(+) (g, $^(^x) or = ijj^'^ (g) $o(x) 

where 

V^W^(J) and ^("^^(J) (2.13) 

then the deflection would be in the positive (negative) z-direction (by a rather definite angle). 
This limiting behavior is readily seen for $o = ^o{z)(f){x,y) and B = (0,0, -B), so that the z- 
motion is completely decoupled from the motion along the other two directions, and by making 
the standard (albeit unphysical) assumption [iH], [Ej 

dB 

— = const >0. (2.14) 
oz 

whence 

/icr-B = (6 + az)(7z 
where a > and b are constants. Then 

^i-^^(^i^\%M^^y)) and ^.1-)^ ° ^ 

V ; V '{z)(f)t{x,y) 

where ^j^'' are the solutions of 

for initial conditions $0^^"* = ^o{z)- Since z generates translations of the 2;-component of the 
momentum, the behavior described above follows easily. More explicitly, the limiting behavior 
for t — >■ cx) readily follows by a stationary phase argument on the explicit solution^ of ()2.15|) . 

^Eq. (|2.15|) is readily solved: 
where (by the standard rules for the Green's function of linear and quadratic Hamiltonians) 
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More simply, we may consider the initial Gaussian state 



(2rf27r)3 

for which (2;)^, the probability density of the particle being at a point of 2;-coordinate is, 
by the linearity of the interaction in ()2.15p . a Gaussian with mean and mean square deviation 
given respectively by 

For a more general initial wave function, 

\i/ = iP ^% 

passage through the magnetic field will, by linearity, spht the wave function into an upward- 
deflected piece (proportional to ip^~^^) and a downward-deflected piece (proportional to ip^'^), 
with corresponding deflections of the trajectories. The outcome is registered by detectors placed 
in the paths of these two possible "beams." Thus of the four kinematically possible outcomes 
("pointer orientations") the occurrence of no detection and of simultaneous detection can be 
ignored as highly unlikely, and the two relevant outcomes correspond to registration by either 
the upper or the lower detector. Accordingly, for a measurement of the experiment is equipped 
with a "calibration" (i.e., an assignment of numerical values to the outcomes of the experiment) 
A+ = 1 for upper detection and A_ = —1 for lower detection (while for a measurement of the 
2-component of the spin angular momentum itself the calibration is given by ^hX±). 

Note that one can completely understand what's going on in this Stern-Gerlach experiment 
without invoking any putative property of the electron such as its actual 2;-component of spin 
that is supposed to be revealed in the experiment. For a general initial wave function there is 
no such property. What is more, the transparency of the analysis of this experiment makes it 
clear that there is nothing the least bit remarkable (or for that matter "nonclassical" ) about 
the nonexistence of this property. But the failure to pay attention to the role of operators 
as observables, i.e., to precisely what we should mean when we speak of measuring operator- 
observables, helps create a false impression of quantum peculiarity. 

2.6 A Remark on the Reality of Spin in Bohmian Mechanics 

Bell has said that (for Bohmian mechanics) spin is not real. Perhaps he should better have 
said: "Even spin is not real," not merely because of all observables, it is spin which is generally 
regarded as quantum mechanically most paradigmatic, but also because spin is treated in ortho- 
dox quantum theory very much like position, as a "degree of freedom" — a discrete index which 
supplements the continuous degrees of freedom corresponding to position — in the wave function. 

Be that as it may, his basic meaning is, we believe, this: Unlike position, spin is not primitive, 
i.e., no actual discrete degrees of freedom, analogous to the actual positions of the particles, are 
added to the state description in order to deal with "particles with spin." Roughly speaking, spin 
is merely in the wave function. At the same time, as explained in Section 12.51 "spin measure- 
ments" are completely clear, and merely reflect the way spinor wave functions are incorporated 
into a description of the motion of conflgurations. 



t/j = aij^+^ + (3^)^-'^ (2.17) 
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In this regard, it might be objected that while spin may not be primitive, so that the result of 
our "spin measurement" will not reflect any initial primitive property of the system, nonetheless 
this result is determined by the initial configuration of the system, i.e., by the position of our 
electron, together with its initial wave function, and as such — as a function Xa-^(q, tp) of the state 
of the system — it is some property of the system and in particular it is surely real. We shall 
address this issue in Sections 18.31 and 18.41 

2.7 The Framework of Discrete Experiments 

We shall now consider a generic experiment. Whatever its significance, the information conveyed 
by the experiment is registered in the apparatus as an output, represented, say, by the orientation 
of a pointer. Moreover, when we speak of a generic experiment, we have in mind a fairly definite 
initial state of the apparatus, the ready state $o = ^oiu), one for which the apparatus should 
function as intended, and in particular one in which the pointer has some "null" orientation, as 
well as a definite initial state of the system = ip^x) on which the experiment is performed. 
Under these conditions it turns out [23] that the initial t = wave function \I^o = ^o(q') of the 
composite system formed by system and apparatus, with generic configuration q = {x,y), has a 
product form, i.e.. 

Such a product form is an expression of the independence of system and apparatus immediately 
before the experiment begins.^" 

For Bohmian mechanics we should expect in general, as a consequence of the quantum equi- 
librium hypothesis, that the outcome of the experiment — the final pointer orientation — will be 
random: Even if the system and apparatus initially have definite, known wave functions, so that 
the outcome is determined by the initial configuration of system and apparatus, this configu- 
ration is random, since the composite system is in quantum equilibrium, and the distribution 
of the final configuration is given by \'$T{x,y)\'^, where \1/t is the wave function of the system- 
apparatus composite at the time t = T when the experiment ends, and x, respectively y, is the 
generic system, respectively apparatus, configuration. 

Suppose now that \1/t has the form ()2.8|) . which roughly corresponds to assuming that the 
experiment admits, i.e., that the apparatus is so designed that there is, only a finite (or countable) 
set of possible outcomes, given, say, by the different possible macroscopically distinct pointer 
orientations of the apparatus and corresponding to a partition of the apparatus configuration 
space into macroscopically disjoint regions Ga, ol = 1,2,....^^ We arrive in this way at the 
notion of discrete experiment, for which the time evolution arising from the interaction of the 
system and apparatus from t = to t = T is given by the unitary map 

a a 

where Ti. is the system Hilbert space of square-integrable wave functions with the usual inner 

-'^"It might be argued that it is somewhat unrealistic to assume a sharp preparation of ip, as well as the possibility 
of resetting the apparatus always in the same initial state $o- We shall address this issue in Section 6 

^^Note that to assume there are only finitely, or countably, many outcomes is really no assumption at all, since 
the outcome should ultimately be converted to digital form, whatever its initial representation may be. 
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product 

{ip,(f)) = I il>*{x) (j){x) dx. 



and the are a fixed set of (normalized) apparatus states supported by the macroscopically 
distinct regions Ga of apparatus configurations. 

The experiment usually comes equipped with an assignment of numerical values Aq, (or a 
vector of such values) to the various outcomes a. This assignment is defined by a "calibration" 
function F on the apparatus configuration space assuming on each region Ga the constant value 
Aq. If for simplicity we assume that these values are in one-to-one correspondence with the 
outcomes^^ then 

Pa = I \^T{,x,y)\^dxdy= I \^T{x,y)\^dx dy (2.19) 

is the probability of finding A^, for initial system wave function ip. Since ^a'iv) = for y E Ga 
unless a = a', we obtain 

Pa= I dx I \Y,i'c.'{x)^a'{y)?dy= I \i,a{x)\''dx=\\i,a\?. (2.20) 

Note that when the result A^ is obtained, the effective wave function of the system undergoes 
the transformation ip ^ tpa- 

A simple example of a discrete experiment is provided by the map 

U : i;0%^^Caip0^a, (2.21) 

a 

where the Cq are complex numbers such that J2a I'^aP ~ then pa = |cap. Note that the 
experiment defined by 1)2.211) resembles a coin-flip more than a measurement since the outcome 
a occurs with a probability independent of ip. 



2.8 Reproducibility and its Consequences 

Though for a generic discrete experiment there is no reason to expect the sort of "measurement- 
like" behavior typical of familiar quantum measurements, there are, however, special experiments 
whose outcomes are somewhat less random than we might have thought possible. According to 
Schrodinger [72]: 

The systematically arranged interaction of two systems (measuring object and mea- 
suring instrument) is called a measurement on the first system, if a directly-sensible 
variable feature of the second (pointer position) is always reproduced within certain 
error limits when the process is immediately repeated (on the same object, which in 
the mean time must not be exposed to additional influences). 

To implement the notion of "measurement-like" experiment considered by Schrodinger, we 
first make some preliminary observations concerning the unitary map ()2.18)1 . Let P[<i>a] be the 

^■^We shall consider the more general case later on in Subsection 13.2.41 
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orthogonal projection in the Hilbert space 0^ 7i $q onto the subspace ?i $q and let Tia be 
the subspaces of 7i defined by 

P[$^] [U{H $0)] = 7^ ® $a . (2.22) 

(Since the vectors in Ha arise from projecting \1't = J2a'^<^ ® onto its a-component, Ha is 
the space of the "collapsed" wave functions associated with the occurrence of the outcome a.) 
Then _ 

UiH 0^0) C^Ha® (2.23) 

a 

Note, however, that it need not be the case that U(H (S> $0) = ©a'^a ® ^a, and that the 
spaces Ha need be neither orthogonal nor distinct; e.g., for 1)2.211) Ha = H and U{H® $0) = 

A "measurement-like" experiment is one which is reproducible in the sense that it will yield 
the same outcome as originally obtained if it is immediately repeated. (This means in particular 
that the apparatus must be immediately reset to its ready state, or a fresh apparatus must 
be employed, while the system is not tampered with so that its initial state for the repeated 
experiment is its final state produced by the first experiment.) Thus the experiment is reproducible 
if _ _ 

U{Ha ® $0) ^Ha®^a (2.24) 

or, equivalently, if there are spaces Hj C Ha such that 

U{Ha ® $o) = HJ ® . (2.25) 

Note that it follows from the unitarity of Ujmd the orthogonality of the subspaces Ha ® ^a 
that the subspaces Ha ® '^'o and hence the Ha are also orthogonal. Therefore, by taking the 
orthogonal sum over a of both sides of ()2.25p . we obtain 

@U{Ha®^,) = u(@Ha®%\ =0^„'®<f.. (2.26) 

a \ a J a 

If we now make the simplifying assumption that the subspaces Ha are finite dimensional, we have 
from unitarity that Ha = Hj , and thus, by comparing ()2.23|) and ()2.26|) . that equahty holds in 
and that 

H = ^Ha (2.27) 

a 

with 

U{Ha ® $o) =Ha®^a (2.28) 

for 

nj — n_j n_j I 

I \^ a — ' ^a ' a • 



^^Note that if Ji has finite dimension n, and the number of outcomes a. is to, dim [C/(7i ® <i>o)] = while 
dim [0^ Ji ® $a] = n-m. 
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Therefore if the wave function of the system is initially in Ti^, outcome a definitely occurs and 
the value Aq, is thus definitely obtained (assuming again for simplicity one-to-one correspondence 
between outcomes and results). It then follows that for a general initial system wave function 

a 

where P-j-c^ is the projection in TC onto the subspace TCa, that the outcome a, with result Aq,, is 
obtained with (the usual) probability 

Pa=\\PnM' = {^,Pn.,^), (2.29) 

which follows from (j^UHl), and (ETTHIl since U {Pn^ip^^o) = -ipa^^a and hence = 

\\ipa\\ by unitarity. In particular, when the A^ are real-valued, the expected value obtained is 

J^P.Aq = J2\a\\PHM' = (^,^^) (2.30) 

a a 

where 

A = J2>^aPn. (2.31) 

a 

is the self-adjoint operator with eigenvalues Aq and spectral projections Pua- 



2.9 Operators as Observables 

What we wish to emphasize here is that, insofar as the statistics for the values which result from 
the experiment are concerned, 

the relevant data for the experiment are the collection {Ha} of special orthogonal 
subspaces, together with the corresponding calibration {Aq,}, 

and this data is compactly expressed and represented by the self-adjoint operator A, on the system 
Hilbert space 7i, given by \2.31]) . Thus, under the assumptions we have made, with a reproducible 
experiment S we naturally associate an operator A = A^, a. single mathematical object, defined 
on the system alone, in terms of which an efficient description (|2.29p of the statistics of the 
possible results is achieved; we shall denote this association by 

(^^A. (2.33) 

If we wish we may speak of "operators as observables," and when an experiment S' is associated 
with a self-adjoint operator A, as described above, we may say that the experiment S' is a 
"measurement" of the observable represented by the self-adjoint operator A. If we do so, however, 
it is important that we appreciate that in so speaking we merely refer to what we have just 
derived: the role of operators in the description of certain experiments.^^ 

So understood, the notion of operator-as-observable in no way implies that anything is gen- 
uinely measured in the experiment, and certainly not the operator itself! In a general experiment 

"'^^Operators as observables also naturally convey information about the system's wave function after the exper- 
iment. For example, for an ideal measurement, when the outcome is a the wave function of the system after the 
experiment is (proportional to) P-j^ ip. We shall elaborate upon this in the next section. 
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no system property is being measured, even if the experiment happens to be measurement-hke. 
(Position measurements in Bohmian mechanics are of course an important exception.) What in 
general is going on in obtaining outcome a is completely straightforward and in no way suggests, 
or assigns any substantive meaning to, statements to the effect that, prior to the experiment, 
observable A somehow had a value — whether this be in some determinate sense or in the 
sense of Heisenberg's "potentiality" or some other ill-defined fuzzy sense — which is revealed, or 
crystallized, by the experiment. Even speaking of the observable A as having value Aq, when 
the system's wave function is in TYq, i.e., when this wave function is an eigenstate of A of eigen- 
value Aq, — insofar as it suggests that something peculiarly quantum is going on when the wave 
function is not an eigenstate whereas in fact there is nothing the least bit peculiar about the 
situation — perhaps does more harm than good. 

It might be objected that we are claiming to arrive at the quantum formalism under some- 
what unrealistic assumptions, such as, for example, reproducibility or finite dimensionality. We 
agree. But this objection misses the point of the exercise. The quantum formalism itself is an 
idealization; when applicable at all, it is only as an approximation. Beyond illuminating the role 
of operators as ingredients in this formalism, our point was to indicate how naturally it emerges. 
In this regard we must emphasize that the following question arises for quantum orthodoxy, but 
does not arise for Bohmian mechanics: For precisely which theory is the quantum formalism an 
idealization? 

We shall discuss how to go beyond the idealization involved in the quantum formalism in 
Section 4 — after having analyzed it thoroughly in Section 3. First we wish to show that many 
more experiments than those satisfying our assumptions can indeed be associated with operators 
in exactly the manner we have described. 

2.10 The General Framework of Bohmian Experiments 

According to ()2.19j) the statistics of the results of a discrete experiment are governed by the 
probability measure ps^^ °F^^, where pqjj.{dq) = \'^T{(l)\'^dq is the quantum equilibrium measure. 
Note that discreteness of the value space of F plays no role in the characterization of this 
measure. This suggests that we may consider a more general notion of experiment, not based 
on the assumption of a countable set of outcomes, but only on the unitarity of the operator U, 
which transforms the initial state ■?/' ® $o into the final state "^t, and on a generic calibration 
function F from the configuration space of the composite system to some value space, e.g., M, 
or R™, giving the result of the experiment as a function F{Qt) of the final configuration Qt of 
system and apparatus. We arrive in this way at the notion of general experiment 



where the unitary U embodies the interaction of system and apparatus and the function F could 
be completely general. Of course, for application to the results of real-world experiments F might 
represent the "orientation of the apparatus pointer" or some coarse-graining thereof. 

Performing (f on a system with initial wave function ip leads to the result Z = F{Qt) and since 
Qt is randomly distributed according to the quantum equilibrium measure p*j,, the probability 
distribution of Z is given by the induced measure 



<^ = {<!>o,U,F}, 



(2.34) 



Pi, = P^T ° F 



1-1 



(2.35) 
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(We have made explicit only the dependence of the measure on ip, since the initial apparatus 
state $0 is of course fixed, defined by the experiment S'.) Note that this more general notion of 
experiment eliminates the slight vagueness arising from the imprecise notion of macroscopic upon 
which the notion of discrete experiment is based. Note also that the structure ()2.34|) conveys 
information about the wave function ()2.6p of the system after a certain result F{Qt) is obtained. 

Note, however, that this somewhat formal notion of experiment may not contain enough 
information to determine the detailed Bohmian dynamics, which would require specification 
of the Hamiltonian of the system-apparatus composite, that might not be captured by U . In 
particular, the final configuration Qt may not be determined, for given initial wave function, as 
a function of the initial configuration of system and apparatus, (f does, however, determine what 
is relevant for our purposes about the random variable Qt-, namely its distribution, and hence 
that oiZ = F{Qt). 

Let us now focus on the right had side of the equation ()2.29j) . which establishes the association 
of operators with experiments: {if), Puci^) is the probability that "the operator A has value Aq,", 
and according to standard quantum mechanics the statistics of the results of measuring a general 
self-adjoint operator A, not necessarily with pure point spectrum, in the (normalized) state ip 
are described by the probability measure 

A^/i;J(A) = (^,P^(A)^) (2.36) 

where A is a (Borel) set of real numbers and : A i-^ P^(A) is the projection-valued-measure 
(PVM) uniquely associated with A by the spectral theorem. (We recall [7D] that a PVM is 
a normalized, countably additive set function whose values are, instead of nonnegative reals, 
orthogonal projections on a Hilbert space Ti. Any PVM P onTi determines, for any given ip &7i, 
a probability measure /i^ = /i^ : A P(A)ip) on M. Integration against projection- valued- 

measure is analogous to integration against ordinary measures, so that B = J f{X)P{dX) is well- 
defined, as an operator on Ti. Moreover, by the spectral theorem every self-adjoint operator A is 
of the form A = f \P{dX), for a unique projection- valued- measure P = P^, and / f{\)P{dX) = 

m. ) 

It is then rather clear how ()2.33|) extends to general self-adjoint operators: a general exper- 
iment ^ is a measurement of the self-adjoint operator A if the statistics of the results of S are 
given by \2. 3b]) . i.e., 

S ^ A if and only if = f^^ ■ (2.37) 

In particular, if S' i—>- A, then the moments of the result of ^ are the moments of A: 

< >= J V{iP,P{d\)^) = 

3 The Quantum Formalism 

The spirit of this section will be rather different from that of the previous one. Here the focus 
will be on the formal structure of experiments measuring self-adjoint operators. Our aim is to 
show that the standard quantum formalism emerges from a formal analysis of the association 
S' i-^ A between operator and experiment provided by ()2.37p . By "formal analysis" we mean not 
only that the detailed physical conditions under which might S' A hold (e.g., reproducibility) 
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will play no role, but also that the practical requirement that S be physically realizable will be 
of no relevance whatsoever. 

Note that such a formal approach is unavoidable in order to recover the quantum formalism. 
In fact, within the quantum formalism one may consider measurements of arbitrary self-adjoint 
operators, for example, the operator A = X^P + PX^, where X and P are respectively the 
position and the momentum operators. However, it may very well be the case that no "real 
world" experiment measuring A exists. Thus, in order to allow for measurements of arbitrary self- 
adjoint operators we shall regard ()2.H4j) as characterizing an abstract experiment" ; in particular, 
we shall not regard the unitary map U as arising necessarily from a (realizable) Schrodinger time 
evolution. We may also speak of virtual experiments. 

In this regard one should observe that to resort to a formal analysis is indeed quite common 
in physics. Consider, e.g., the Hamiltonian formulation of classical mechanics that arose from an 
abstraction of the physical description of the world provided by Newtonian mechanics. Here we 
may freely speak of completely general Hamiltonians, e.g. H{p, q) = p^, without being concerned 
about whether they are physical or not. Indeed, only very few Hamiltonians correspond to 
physically realizable motions! 

A warning: As we have stressed in the introduction and in Section 12.91 when we speak here 
of a measurement we don't usually mean a genuine measurement — an experiment revealing the 
pre-existing value of a quantity of interest, the measured quantity or property. (We speak in this 
unfortunate way because it is standard.) Genuine measurement will be discussed much later, in 
Section [7| 

3.1 Weak Formal Measurements 

The first formal notion we shall consider is that of weak formal measurement, formalizing the 
relevant data of an experiment measuring a self-adjoint operator: 

Any orthogonal decomposition H = ^^Ha, i-e., any complete collection {Ha} of 
mutually orthogonal subspaces, paired with any set {\a} of distinct real numbers, (3.1) 
defines the weak formal measurement Ai = {{Ha, Aq,)} = {Ha, Aq,}. 

(Compare ()3.1|) with ()2.32|) and note that now we are not assuming that the spaces Ha are 
finite-dimensional.) The notion of weak formal measurement is aimed at expressing the minimal 
structure that all experiments (some or all of which might be virtual) measuring the same operator 
A = XaP-Ha have in common {Phc is the orthogonal projection onto the subspace Ha)- Then, 
"to perform Ai" shall mean to perform (at least virtually) any one of these experiments, i.e., 
any experiment such that 

Pa = {^P,Pn^^) (3.2) 

is the probability of obtaining the result Aq, on a system initially in the state ip. (This is of course 
equivalent to requiring that the result Aq, is definitely obtained if and only if the initial wave 
function ip e Ha-) 

Given M. = {Ha, Aq} consider the set function 

P: A^P(A)= J2 Pn., (3.3) 
where A is a set of real numbers (technically, a Borel set). Then 
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1) P is normalized, i.e., P(M) = /, where / is the identity operator and R is the real hne, 

2) P(A) is an orthogonal projection, i.e., P(A)^ = P(A) = P(A)*, 

3) P is countably additive, i.e., P(|J„ A„) = Xln, -P(^n)> for A„ disjoint sets. 

Thus P is a projection-valued-measure and therefore the notion of weak formal measurement is 
indeed equivalent to that of "discrete" PVM, that is, a PVM supported by a countable set {Xa} 
of values. 

More general PVMs, e.g. PVMs supported by a continuous set of values, will arise if we 
extend ()3.1|) and base the notion of weak formal measurement upon the general association 
()2.37|) between experiments and operators. If we stipulate that 

any projection-valued-measure P onTi defines a weak formal measurement M. = P, (3.4) 

then "to perform A^" shall mean to perform any experiment S associated with A = J XP{dX) 
in the sense of fl2.37|l . 

Note that since by the spectral theorem there is a natural one-to-one correspondence between 
PVMs and self-adjoint operators, we may speak equivalently of the operator A = Aj^, for given 
A^, or of the weak formal = A^yi, for given A. In particular, the weak formal measurement 
M.A represents the equivalence class of all experiments <§ ^ A. 

3.2 Strong Formal Measurements 

We wish now to classify the different experiments S associated with the same self-adjoint operator 
A by taking into account the effect of S on the state of the system, i.e., the state transformations 
ijj ^ ipa induced by the occurrence of the various results of S'. Accordingly, unless otherwise 
stated, from now on we shall assume to be a discrete experiment measuring A = XaP-Ha i 
for which the state transformation if) ^ if) ^ is defined by fl2.18|) . This leads to the notion of 
strong formal measurements. For the most important types of strong formal measurements, 
ideal, normal and standard, there is a one-to-one correspondence between a's and numerical 
results Xa- 

3.2.1 Ideal Measurements 

Given a weak formal measurement of A, the simplest possibility for the transition ■?/'—>■'?/'„ is 
that when the result is obtained, the initial state ip is projected onto the corresponding space 
Tiai that 

^ ^ V'a = (3.5) 

This prescription defines uniquely the ideal measurement of A. (The transformation ip ^ ipa 
should be regarded as defined only in the projective sense: ip ^ ipa and ip cipa (c 7^ 0) should 
be regarded as the same transition.) "To perform an ideal measurement of A" shall then mean 
to perform a discrete experiment S' whose results are statistically distributed according to ()3.2|) 
and whose state transformations ()2.18|) are given by ()3.5|) . 

Under an ideal measurement the wave function changes as little as possible: an initial ip G Tia 
is unchanged by the measurement. Ideal measurements have always played a privileged role in 
quantum mechanics. It is the ideal measurements that are most frequently discussed in textbooks. 
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It is for ideal measurements that the standard collapse rule is obeyed. When Dirac wrote: 
"a measurement always causes the system to jump into an eigenstate of the dynamical variable 
that is being measured" he was referring to an ideal measurement. 

3.2.2 Normal Measurements 

The rigid structure of ideal measurements can be weakened by requiring only that Tia as a whole, 
and not the individual vectors in Tia, is unchanged by the measurement and therefore that the 
state transformations induced by the measurement are such that when the result Aq is obtained 
the transition 

^|J^^P^ = U^Ph^^ (3.6) 

occurs, where the Ua are operators on Tia ( Ua '■ 'Ha — 'Ha)- Then for any such discrete 
experiment S measuring A, the Ua can be chosen so that ()3.6|) agrees with ()2.18|) . i.e., so that for 
"0 e 'Ha-i U{%Ij® $o) = Ua^J ® and hence so that Ua is unitary (or at least a partial isometry). 
Such a measurement, with unitaries Ua '■ 'Ha Ha-, will be called a normal measurement of A. 

In contrast with an ideal measurement, a normal measurement of an operator is not uniquely 
determined by the operator itself: additional information is needed to determine the transitions, 
and this is provided by the family {Ua\- Different families define different normal measurements 
of the same operator. Note that ideal measurements are, of course, normal (with Ua = la = iden- 
tity on 'Ha)) and that normal measurements with one-dimensional subspaces H.a are necessarily 
ideal. 

Since the transformations (|3.6|) leave invariant the subspaces TYq, the notion of normal mea- 
surement characterizes completely the class of reproducible measurements of self-adjoint oper- 
ators. Following the terminology introduced by Pauli [^1], normal measurement are sometimes 
called measurements of first kind. Normal measurements are also quantum non demolition ( QND) 
measurements defined as measurements such that the operators describing the induced 
state transformations, i.e, the operators Ra = UaPua^ commute with the measured operator 
^4 = ^ XaP-Hc- (This condition is regarded as expressing that the measurement leaves the mea- 
sured observable A unperturbed). 

3.2.3 Standard Measurements 

We may now drop the condition that the 'Ha are left invariant by the measurement and consider 
the very general state transformations 

^ = TaPn^i^ (3.7) 

with operators Ta : Ha —>■ 'H. Then, exactly as for the case of normal measurements^it follows 
that Ta can be chosen to be unitary from H.a onto its range 'Ha- The subspaces 'Ha need be 
neither orthogonal nor distinct. We shall write Ra = TaP-Ha general transition operators. 

With Ta as chosen, Ra is characterized by the equation R*^Ra = Phc (where i?* denotes the 
adjoint of Ra)- 

The state transformations ()3.7|) . given by unitaries Ta : 'Ha — * Ha, or equivalently by bounded 
operators Ra on Ti satisfying R^Ra = Phci define what we shall call a standard measurement of 
A. Note that normal measurements are standard measurements with Tia = 'Ha (or H.a C Ha)- 
Although standard measurements are in a sense more realistic than normal measurements (real 
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world measurements are seldom reproducible in a strict sense), they are very rarely discussed in 
textbooks. We emphasize that the crucial data in a standard measurement is given by i?^, which 
governs both the state transformations {ip — >• Raip) and the probabilities (p^ = P-h ip) = 

\\Ram- 

We shall illustrate the main features of standard measurements by considering a very simple 
example: Let {cq, ei, 62, . . .}, be a fixed orthonormal basis of Ti. and consider the standard mea- 
surement whose results are the numbers 0, 1, 2, . . . and whose state transformations are defined 
by the operators 

Ra=\eo){ea\ i-e., Raip ^ {ea,ip)eo, a = 0,1,2,... 

With such RaS are associated the projections Pa — R^Ra = |ea)(ea| , i.e., the projections 
onto the one dimensional spaces Ha spanned respectively by the vectors e^. Thus, this is a 
measurement of the operator A = XIq Note that the spaces Ti^, i.e. the ranges of the 
i?Q,'s, are all the same and equal to the space Hq generated by the vector Cq. The measurement 
is then not normal since Tia 7^ Ti-a- Finally, note that this measurement could be regarded as 
giving a simple model for a photo detection experiment, where any state is projected onto the 
"vacuum state" Cq after the detection. 

3.2.4 Strong Formal Measurements 

We shall now relax the condition that a 1— > Aq is one-to-one, as we would have to do for an 
experiment having a general calibration a 1-^ Aq,, which need not be invertible. This leads to 
(what we shall call) a strong formal measurement. Since this notion provides the most general 
formalization of the notion of a "measurement of a self-adjoint operator" that takes into account 
the effect of the measurement on the state of the system, we shall spell it out precisely as follows: 

Any complete (labelled) collection {Tia} of mutually orthogonal subspaces, any (la- 
belled) set {Aq,} of not necessarily distinct real numbers, and any (labelled) collec- 
tion {Ra} of bounded operators on H, such that R*aRa = (^^^ projection onto 
'Ha), defines a strong formal measurement. 

A strong formal measurement will be compactly denoted by = {(TYq,, A^, /?«)} = 
{Ha,^a)Ra}-i Or eveu more compactly by = {\aiRa} (the spaces Ha can be extracted 
from the projections Pn„ — R*^Ra)- With M. is associated the operator A — Yl/^aPua- Note 
that since the Aq, are not necessarily distinct numbers, P-^^ need not be the spectral projection 
P^{\a) associated with Aq,; in general 

P\\) = Pn.^ 

i.e., it is the sum of all the P-^^'s that are associated with the value A.-*^^ "Tb perform the 
measurement A4" on a system initially in ip shall accordingly mean to perform a discrete ex- 
periment S" such that: 1) the probability p(A) of getting the result A is governed by A, i.e., 

"'^^It is for this reason that it would be pointless and inappropriate to similarly generalize weak measurements. 
It is only when the state transformation is taken into account that the distinction between the outcome a (which 
determines the transformation) and the result Aq, (whose probability the formal measurement is to supply) becomes 
relevant. 
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p{X) = {ip, P^{\)i') , and 2) the state transformations of are those prescribed by A^, i.e., 

Observe that strong formal measurements do provide a more reahstic formahzation of the 
notion of measurement of an operator than standard measurements: the notion of discrete ex- 
periment does not imply a one-to-one correspondence between outcomes, i.e, final macroscopic 
configurations of the pointer, and the numerical results of the experiment. 

The relationship between (weak or strong) formal measurements, self-adjoint operators, and 
experiments can be summarized by the following sequence of maps: 

A (3.9) 

The first map expresses that Ai (weak or strong) is a formalization of <f — it contains the "relevant 
data" about — and it will be many-to-one if is a weak formal measurement^^; the second 
map expresses that is a formal measurement of A and it will be many-to-one if Ai is (required 
to be) strong and one-to-one if Ai is weak. Note that ^ A is always many-to-one. 

3.3 From Formal Measurements to Experiments 

Given a strong measurement Al = {Ha, ^a,Ra} one may easily construct a map (j2.18p defining 
a discrete experiment = S'm associated with Ai: 

U : i)®^o^Y^{Rai))(^^a (3.10) 

a 

The unitarity of U ( from 7i ® $o onto the range of U) follows then immediately from the 
orthonormality of the since 

\\RM' = J2{i^,R:Ra^) = (^,5^Ph.^) = (^,^) = \m' (s.n) 

a a a 

This experiment is abstractly characterized by: 1) the finite or countable set / of outcomes a, 
2) the apparatus ready state $0 and the set of normalized apparatus states, 3) the unitary 

map U : 7i ® $0 — ® given by ()3.10p . 4) the cahbration a t— > Aq assigning numerical 
values (or a vector of such values) to the various outcomes a. Note that U need not arise from 
a Schrodinger Hamiltonian governing the interaction between system and apparatus. Thus ff 
should properly be regarded as an "abstract" experiment as we have already pointed out in the 
introduction to this section. 

3.4 Von Neumann Measurements 

We shall now briefly comment on the relation between our approach, based on formal measure- 
ments, and the widely used formulation of quantum measurement in terms of von Neumann 
measurements jTl] . 

A von Neumann measurement of A = ^aPua on a system initially in the state ip can be 
described as follows (while the nondegeneracy of the eigenvalues of A — i.e., that dim(?ia) = 1 — 
is usually assumed, we shall not do so): Assume that the (relevant) configuration space of the 

There is an obvious natural unitary equivalence between the preimages to" of a strong formal measurement 

M. 
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apparatus, whose generic configuration shall be denoted by y, is one-dimensional, so that its 
Hilbert space ?^_4 ~ L^(]R), and that the interaction between system and apparatus is governed 
by the Hamiltonian 

H = H,n = -fA(dPy (3.12) 

where Py = ihd/dy is (minus) the momentum operator of the apparatus. Let $o = ^o{y) be 
the ready state of the apparatus. Then for ip = Puai^ one easily sees that the unitary operator 
U = e"*^^/'^ transforms the initial state tpa ® '^'o into ipa ® where = ^o{y — Xa'jT), so that 
the action of U on general ip = P'Ha'^ is 

U: ^ ® $0 ^ ® (3.13) 

a 

If $0 has sufficiently narrow support, say around ?/ = 0, the will have disjoint support 
around the "pointer positions" ya = ^alT, and thus will be orthogonal, so that, with calibration 
F{y) = y/'jT (more precisely, F{y) = ya/lT for y in the support of $„), the resulting von 
Neumann measurement becomes a discrete experiment measuring A\ comparing ()3.13p and ()3.5|) 
we see that it is an ideal measurement of A}'^ 

Thus, the framework of von Neumann measurements is less general than that of discrete 
experiments, or equivalently of strong formal measurements; at the same time, since the Hamil- 
tonian ifvN is not of Schrodinger type, von Neumann measurements are just as formal. (We note 
that more general von Neumann measurements of A can be obtained by replacing H^^ with more 
general Hamiltonians; for example, H'^^ = Hq + -ffvN, where Hq is a self-adjoint operator on the 
system Hilbert space which commutes with A, gives rise to a normal measurement of A, with 

= e-^^-f^»/^Pw^.Thus by proper extension of the von Neumann measurements one may arrive 
at a framework of measurements completely equivalent to that of strong formal measurements.) 

3.5 Preparation Procedures 

Before discussing further extensions of the association between experiments and operators, we 
shall comment on an implicit assumption apparently required for the measurement analysis to 
be relevant: that the system upon which measurements are to be performed can be prepared in 
any prescribed state ip. 

Firstly, we observe that the system can be prepared in a prescribed state ip by means of 
an appropriate standard measurement Ai performed on the system when it is initially in an 
unknown state We have to choose M. = {Ha,Xa,Ra} in such a way that Raoi^' = i^, for 
some ao and all ip', i.e., that Ran(i?aQ) = span(?/^); then from reading the result Xao we may 
infer that the system has collapsed to the state ip. The simplest possibility is for Ai to be an 
ideal measurement with at least a one- dimensional subspace Tiag that is spanned by ip. Another 
possibility is to perform a (nonideal) standard measurement like that of the example at the end 
of Section 13.2.31 which can be regarded as defining a preparation procedure for the state Cq. 

Secondly, we wish to emphasize that the existence of preparation procedures is not as crucial 
for relevance as it may seem. If we had only statistical knowledge about the initial state ip, 
nothing would change in our analysis of Bohmian experiments of Section 2, and in our conclusions 

^''it is usually required that von Neumann measurements be impulsive (7 large, T small) so that only the 
interaction term H3.12|l contributes significantly to the total Hamiltonian over the course of the measurement. 
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concerning the emergence of self-adjoint operators, except that the uncertainty about the final 
configuration of the pointer would originate from both quantum equilibrium and randomness in 
We shall elaborate upon this later when we discuss Bohmian experiments for initial states 
described by a density matrix. 

3.6 Measurements of Commuting Families of Operators 

As hinted in Section |2IZl the result of an experiment <f might be more complex than we have sug- 
gested until now in Section 3: it might be given by the vector Aq = {Xa \ ■ ■ ■ , XcT^) corresponding 
to the orientations of m pointers. For example, the apparatus itself may be a composite of m 
devices with the possible results Xa^ corresponding to the final state of the i-th device. Nothing 
much will change in our discussion of measurements if we now replace the numbers Aq, with 
the vectors Aq, = {Xa\ ■ ■ ■ , A^™'*), since the dimension of the value space was not very relevant. 
However will now be associated, not with a single self-adjoint operator, but with a commuting 
family of such operators. In other words, we arrive at the notion of an experiment ^ that is a 
measurement of a commuting family of self-adjoint operators, namely the family 



Then the notions of the various kinds of formal measurements — weak, ideal, normal, standard, 
strong — extend straightforwardly to formal measurements of commuting families of operators. In 
particular, for the general notion of weak formal measurement given bv l3.41 P becomes a PVM on 
with associated operators At = X'^^P{dX) [A = (A^^), . . . , A^")) G M*"]. And just as for 
PVMs on M and self-adjoint operators, this association in fact yields, by the spectral theorem, 
a one-to-one correspondence between PVMs on M™ and commuting families of m self-adjoint 
operators. The PVM corresponding to the commuting family {Ai, . . . , Am) is in fact simply the 
product PVM P = P^ = X ■ ■ ■ X P^™ given on product sets by 



where P^^, . . . , P^™ are the PVMs of Ai, . . . , Am, and Aj C M, with the associated probability 
distributions on given by the spectral measures for A 



for any (Borel) set A C M"". 

i8"\Ye recall some basic facts about commuting families of self-adjoint operators |74[ I71L 1^ . The self-adjoint 
operators Ai, . . . ,Am form a commuting family if they are bounded and pairwise commute, or, more generally, 
if this is so for their spectral projections, i.e., if [P^''{A), P^^ (T)] = for all i,j = 1, . . . ,m and (Borel) sets 
A, r C M. A commuting family A = (Ai, . . . , Am) of self-adjoint operators is called complete if every self-adjoint 
operator C that commutes with all members of the family can be expressed as C = g{A\^ A2, . . . ) for some function 
g. The set of all such operators cannot be extended in any suitable sense (it is closed in all relevant operator 
topologies). For any commuting family (Ai, . . . , Am) of self-adjoint operators there is a self-adjoint operator B 
and measurable functions fi such that Ai = fi{B). If the family is complete, then this operator has simple (i.e., 
nondegenerate) spectrum. 




(3.14) 




(3.15) 



4iA) = {^,P\A)^) 



(3.16) 
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In particular, for a PVM on M"^, corresponding to A = {Ai, . . . , Am), the i-marginal distri- 
bution, i.e., the distribution of the i-th component A^*\ is 



the spectral measure for A^. Thus, by focusing on the respective pointer variables X^^\ we 
may regard an experiment measuring (or a weak formal measurement of) A = {Ai, . . . , A^) as 
providing an experiment measuring (or a weak formal measurement of) each Ai, just as would 
be the case for a genuine measurement of m quantities Ai, ... ,Am- Note also the following: 
If {Ha-i^aiRa} IS a stroug formal measurement oi A = {Ai, . . . , Am), then {Ha,Xa\Ra} is a 
strong formal measurement of Ai, but if {Ha,Xa,Ra} is an ideal, resp. normal, resp. standard, 
measurement of A, {Ha, Xa\ Ra} need not be ideal, resp. normal, resp. standard. 

There is a crucial point to observe: the same operator may belong to different commuting fam- 
ilies. Consider, for example, a measurement of A = [Ai, . . . , Am) and one of -B = {Bi, . . . , Bm), 
where Ai = Bi = C. Then while both measurements provide a measurement of C, they could be 
totally different: the operators A^ and Bi for ^ 7^ 1 need not commute and the PVMs of A and 
B, as well as any corresponding experiments a and £"3, will be in general essentially different. 

To emphasize this point we shall recall a famous example, the EPRB experiment \62\ I14j: A 
pair of spin one-half particles, prepared in a spin-singlet state 



are moving freely in opposite directions. Measurements are made, say by Stern-Gerlach magnets, 
on selected components of the spins of the two particles. Let a, b, c be three different unit 
vectors in space, let cri = a ® I and let cr2 = I ® cr, where a = {ax, (Jy, Cz) are the Pauli 
matrices. Then we could measure the operator cri-a by measuring either of the commuting 
families (cri-a, cr2-b) and (cri-a, cr2-c). However these measurements are different, both as weak 
and as strong measurements, and of course as experiments. In Bohmian mechanics the result 
obtained at one place at any given time will in fact depend upon the choice of the measurement 
simultaneously performed at the other place (i.e., on whether the spin of the other particle is 
measured along b or along c). However, the statistics of the results won't be affected by the 
choice of measurement at the other place because both choices yield measurements of the same 
operator and thus their results must have the same statistical distribution. 

3.7 Functions of Measurements 

One of the most common experimental procedures is to recalibrate the scale of an experiment S': 
if Z is the original result and / an appropriate function, recalibration by / leads to f{Z) as the 
new result. Thus f{(^) has an obvious meaning. Moreover, if t— > A according to ()2.37|) then 



fi^iR X ■ ■ - M X A, X M X ■ ■ ■ X M) = (7/^, P^^{Ai)^) = fif{Ai), 



\ and 



4or\d\) = {^ij,p\f-\d\))i;) = (^,p/(^)(dA)^) 



where the last equality follows from the very definition of 
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provided by the spectral theorem. Thus, 



then 



f{Z) 



J{A) 



(3.17) 



I.e., 



if 



then 



(3.18) 



The notion of function of a formal measurement has then an unequivocal meaning: if }A 
is a weak formal measurement defined by the PVM P then f{M.) is the weak formal mea- 
surement defined by the PVM P o /^^, so that if is a measurement of A then f{Ai) is a 
measurement of f{A); for a strong formal measurement Ai — {Ti-ai^aiRa} the self-evident re- 
quirement that the recalibration not affect the wave function transitions induced by M. leads to 
f{.M) = {Ha, fi^a), Ra}- Note that if is a standard measurement, f{M) will in general not 
be standard (since in general / can be many-to~one) . 

To highhght some subtleties of the notion of function of measurement we shall discuss two 

examples: Suppose that Ai and Ai' are respectively measurements of the commuting families 
A = (^1,^2) and B = (^i.^s), with A1A2 = B1B2 = C. Let / : ^ M, /(Ai,A2) = A1A2. 
Then both f{A4) and f{Ai') are measurement of the same self-adjoint operator C. Nevertheless, 
as strong measurements or as experiments, they could be very different: if A2 and B2 do not 
commute they will be associated with different families of spectral projections. (Even more 
simply, consider measurements Ai^ and Aiy of a^: and cTy and let /(A) = A^. Then f{A4x) and 
f{Aiy) are measurement of / — so that the result must be 1) — but the two strong measurements, 
as well as the corresponding experiments, are completely different.) 

The second example is provided by measurements designed to determine whether the operator 
A — ^XaPua (the Aq's are distinct) has values in some given set A. This determination can 
be accomplished in at least two different ways: Suppose that A4 is an ideal measurement of A 
and let 1a(A) be the characteristic function of the set A. Then we could perform Ia(^), that 
is, we measure A and see whether "yl e A" . But we could also perform an 'Hdeal determination 
oi A e A", that is, an ideal measurement of Ia(^) = P^(A). Now, both measurements provide 
a "measurement of ^4 e A" (i.e., of the operator 1a(^)), since in both cases the results 1 and 
get assigned the same probabilities. However, as strong measurements, they are different: when 
^a{A4) is performed, and the result 1 is obtained, ip undergoes the transition 



where a is the outcome with A^ G A that actually occurs. On the other hand, for an ideal 
measurement of 1a (^), the occurrence of the result 1 will generate the transition 



Note that in this case the state of the system is changed as little as possible. For example, 
suppose that two eigenvalues, say A^^, A^j, belong to A and ip = ipai + ipa2') then determination 
by performing 1a (-^) will lead to either -0^^ or -002) while the ideal determination of ^4 e A will 
not change the state. 



^ ^ P^{A)^ = 5] P^>. 



AaSA 
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3.8 Measurements of Operators with Continuous Spectrum 

We shall now reconsider the status of measurements of self-adjoint operators with continuous 
spectrum. First of all, we remark that while on the weak level such measurements arise very 
naturally — and, as already stressed in Section 13.11 are indeed the first to appear in Bohmian 
mechanics — there is no straightforward extension of the notion of strong measurement to opera- 
tors with continuous spectrum. 

However, for given set of real numbers A, one may consider any determination of A G A, that 
is, any strong measurement of the spectral projection P'^(A). More generally, for any choice of 
a simple function 

N 

/(A) = J]caA.(A), 
j=l 

one may consider the strong measurements of f{A). In particular, let {/*•"'•'} be a sequence of 
simple functions converging to the identity, so that (A) — > A, and let Ain be measurements 
of f^"'\A). Then Ain are approximate measurements of A. 

Observe that the foregoing applies to operators with discrete spectrum, as well as to operators 
with continuous spectrum. But note that while on the weak level we always have 

where is a (general) weak measurement of A (in the sense of fl3.4j) ). if A has continuous 
spectrum Ai will not exist as a strong measurement (in any reasonable generalized sense, since 
this would imply the existence of a bounded-operator-valued function Rx on the spectrum of A 
such that R\RxdX = P^{dX), which is clearly impossible). In other words, in this case there 
can be no actual (generalized) strong measurement that the approximate measurements Ain 
approximate — which is perfectly reasonable. 

3.9 Sequential Measurements 

Suppose that n measurements (with for each i, the Xa] distinct) 
of operators (which need not commute) 

^1 ^ai ^ai ; • • • ! ^a,, 

are successively performed on our system at times 0<ti<t2<---<^Af- Assume that the 
duration of any single measurement is small with respect to the time differences ti — tj-i, so that 
the measurements can be regarded as instantaneous. If in between two successive measurements 
the system's wave function changes unitarily with the operators Ut then, using obvious notation, 

Prob^(^ = A«, ...,An = AS) = ||PS(t.) ■ ■ ■ i?«(ti) i^r, (3.19) 
where R^a]{t) = U^^R^a]Ut and i> is the initial {t = 0) wave function. 
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To understand how ()3.19|) comes about consider first the case where n = 2 and ^2 ~ ~ 0. 
According to standard probabihty rules, the probabihty of obtaining the results Zi = Xa^ for the 
first measurement and Z2 = Xa2 for the second one is the product^^ 

Prob^(Z2 = A£)|Zi = A«) • Prob^Zi = A«) 

(2) 

where the first term is the probability of obtaining given that the result of the first measure- 
ment is Xai- Since Aii then transforms the wave function i/j to Ra^ijj, the (normalized) initial 
wave function for is Ra^ip/WRaiipW, this probability is equal to 

||-Ro2^-RiiVlP 

The second term, the probability of obtaining Xai , is of course 

ll/^iWf ■ Thus 

Prob,(A« = A«, A(^) = Ag) = Wnm^H' 

in this case. Note that, in agreement with the analysis of discrete experiments (see Eq. ()2.20p ). 
the probability of obtaining the results Xai ^^'^ turns out to be the square of the norm of the 
final system wave function associated with these results. Now, for general times ti and ^2 ~ ^1 
between the preparation of at t = and the performance of A^i and between Aii and 
respectively, the final system wave function is R^alUt^-tMlUt.i' = R^alUt^U^-^R^aiUt,^. But 
\\Ra2Ut2Uf~^ Ra^ Ut^ipW = \\Ut~^ Ra2Ut2U^^ Ra^Utj^ipW , and it is easy to see, just as for the simple 
case just considered, that the square of the latter is the probability for the corresponding result, 
whence fl3.19|) for n = 2. Iterating, i.e., by induction, we arrive at ()3.19|) for general n. 

We note that when the measurements A^i, . . . A4n are ideal, the operators Ra} are the orthog- 
onal projections Pi*\ and equation ()3.19|) becomes the standard formula for the joint probabilities 
of the results of a sequence of measurements of quantum observables, usually known as Wigner's 
formula [7E] . 

It is important to observe that, even for ideal measurements, the joint probabilities 
given by ()3.19j) are not in general a consistent family of joint distributions: summation 
in fl3.19|) over the outcomes of the z-th measurement does not yield the joint probabil- 
ities for the results of the measurements of the operators Ai, . . . , Ai^i, Ai+i, . . . An per- 
formed at the times ti, . . . , tj+i, . . . t„. (By rewriting the right hand side of ()3.19|) as 
{^l)^Ra}{tn)* ■ ■ ■ Ra'l{tn)*Ra'}{tn)Ral{ti)%l)) oue easily sees that the "sum rule" will be satisfied 
when i = n or if the operators Ra}{ti) commute. More generally, the consistency is guaranteed 
by the "decoherence conditions" of Griffiths, Omnes, Gell-Mann and Hartle, and Goldstein and 
Page HElinSlllll. 

"'^^This is so because of the conditional independence of the outcomes of two successive measurements given 
the final conditional wave function for the first measurement. More generally, the outcome of any measurement 
depends only on the wave function resulting from the preceding one. For Bohmian experiments this independence 
is a direct consequence of (|2.10|l . One may wonder about the status of this independence for orthodox quantum 
theory. We stress that while this issue might be problematical for orthodox quantum theory, it is not a problem 
for Bohmian mechanics: the conditional independence of two successive measurements is a consequence of the 
theory. (For more on this point, see We also would like to stress that this independence assumption is in 

fact crucial for orthodox quantum theory. Without it, it is hard to see how one could ever be justified in invoking 
the quantum formalism. Any measurement we may consider will follow many earlier measurements. 
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This failure of consistency means that the marginals of the joint probabilities given by ()3.19|) 
are not themselves given by the corresponding case of the formula. This should, however, come as 
no surprise: Since performing the measurement A4i affects the state of the system, the outcome 
of Aii+i should in general depend on whether or not Aii has been performed. Note that there 
is nothing particularly quantum in the fact that measurements matter in this way: They matter 
even for genuine measurements (unlike those we have been considering, in which nothing need 
be genuinely measured), and even in classical physics, if the measurements are such that they 
affect the state of the system. 

The sequences of results Aq, = (AiV , . . . , aL"^), the associated state transformations Ra = 
Ra]Ut„^t^^,Rt~-i ■ ■ ■ Ra^Ut,, and the probabilities 

(i.e., given hj Pa = ||-Rq,|P) define what 
we shall call a sequential measurement of Aii, ■ ■ ■ A^„, which we shall denote by (g) . . . (g) A^i. 
A sequential measurement does not in general define a formal measurement, neither weak nor 
strong, since R^Ra need not be a projection. This fact might seem disturbing (see, e.g., [21]): 
we shall take up this issue in the next section. 

3.10 Some Summarizing Remarks 

The notion of formal measurement we have explored in this section is at the heart of the quantum 
formalism. It embodies the two essential ingredients of a quantum measurement: the self-adjoint 
operator A which represents the measured observable and the set of state transformations Ra 
associated with the measured results. The operator always carries the information about the 
statistics of possible results. The state transformations prescribe how the state of the system 
changes when the measurement is performed. For ideal measurement the latter information is 
also provided by the operator, but in general additional structure (the -Ra's) is required. 

There are some important morals to draw. The association between measurements and oper- 
ators is many-to-one: the same operator A can be measured by many different measurements, 
for example ideal, or normal but not ideal. Among the possible measurements of A, we must 
consider all possible measurements of commuting families of operators that include A, each of 
which may correspond to entirely different experimental setups. 

A related fact: not all measurements are ideal measurements."^^ No argument, physical or 
mathematical, suggests that ideal measurements should be regarded as "more correct" than any 

^''In this regard we observe that the vague behef in a universal collapse rule is as old, almost, as quantum 
mechanics. It is reflected in von Neumann's formulation of quantum mechanics |74| . based on two distinct dy- 
namical laws: a unitary evolution between measurements, and a nonunitary evolution when measurements are 
performed. However, von Neumann's original proposal |74| for the nonunitary evolution — that when a measure- 
ment oi A — ^aPrta is performed upon a system in the state given by the density matrix W , the state of the 
system after the measurement is represented by the density matrix 

a 13 

where, for each a, {4>ap} is a basis for Ha — does not treat the general measurement as ideal. Moreover, this 
expression in general depends on the choice of the basis {4>ap}, and was thus criticized by Liiders [SJ, who 
proposed the transformation 

W ^W' = P-h^WPh^ , 

a 

as it gives a unique prescription. Note that for W = P[^;] , where is the projection onto the initial pure state 
■!/;, W' = 'Y^a PctPytpa] ' where — \ {ip, Pnai^) P ^nd — PHai^j corresponding to an ideal measurement. 
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other type. In particular, the Wigner formula for the statistics of a sequence of ideal measure- 
ments is no more correct than the formula (j3.19j) for a sequence of more general measurement. 
Granting a privileged status to ideal measurements amounts to a drastic and arbitrary restric- 
tion on the quantum formalism qua measurement formalism, since many (in fact most) real world 
measurements would be left out. 

In this regard we note that the arbitrary restriction to ideal measurements affects the research 
program of "decoherent" or "consistent" histories [SHI ESI ESI, since Wigner's formula for a se- 
quence of ideal measurements is unquestionably at its basis. (It should be emphasized however 
that the special status granted to ideal measurements is probably not the main difficulty with 
this approach. The no-hidden-variables theorems, which we shall discuss in Section 7, show that 
the totality of different families of weakly decohering histories, with their respective probability 
formulas, is genuinely inconsistent. While such inconsistency is perfectly acceptable for a mea- 
surement formalism, it is hard to see how it can be tolerated as the basis of what is claimed to 
be a fundamental theory. For more on this, see fIF)\ |33] ■ 

4 The Extended Quantum Formalism 

As indicated in Section 2.9, the textbook quantum formalism is merely an idealization. As 
just stressed, not all real world measurements are ideal. In fact, in the real world the projection 
postulate — that when the measurement of an observable yields a specific value, the wave function 
of the system is replaced by its projection onto the corresponding eigenspace — is rarely obeyed. 
More importantly, a great many significant real- world experiments are simply not at all associated 
with operators in the usual way. Consider for example an electron with fairly general initial wave 
function, and surround the electron with a "photographic" plate, away from (the support of the 
wave function of) the electron, but not too far away. This setup measures the position of "escape" 
of the electron from the region surrounded by the plate. Notice that since in general the time of 
escape is random, it is not at all clear which operator should correspond to the escape position — 
it should not be the Heisenberg position operator at a specific time, and a Heisenberg position 
operator at a random time has no meaning. In fact, there is presumably no such operator, so that 
for the experiment just described the probabilities for the possible results cannot be expressed 
in the form ()2.37p . and in fact are not given by the spectral measure for any operator. 

Time measurements, for example escape times or decay times, are particularly embarrassing 
for the quantum formalism. This subject remains mired in controversy, with various research 
groups proposing their own favorite candidates for the "time operator" while paying little at- 
tention to the proposals of the other groups. For an analysis of time measurements within the 
framework of Bohmian mechanics, see [201; ^^is regard see also 13^ 14 7j. 

Because of these and other difficulties, it has been proposed that we should go beyond 
operators-as-observables, to ^^generalized observables," described by mathematical objects even 
more abstract than operators (see, e.g., the books of Davies j^I], Holevo and Kraus jSHI)- The 
basis of this generalization lies in the observation that, by the spectral theorem, the concept of 
self-adjoint operator is completely equivalent to that of (a normalized) projection- valued measure 
(PVM), an orthogonal-projection- valued additive set function, on the value space M. Orthogonal 
projections are among the simplest examples of positive operators, and a natural generalization 
of a "quantum observable" is provided by a positive-operator- valued measure (POVM): a nor- 
malized, countably additive set function O whose values are positive operators on a Hilbert space 
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Ti. When a POVM is sandwiched by a wave function it generates a probabihty distribution 

/ij: A^/iJ(A) = (^,0(A)^) (4.1) 
in exactly the same manner as a PVM. 

4.1 POVMs and Bohmian Experiments 

From a fundamental perspective, it may seem that we would regard this generalization, to 
positive-operator-valued measures, as a step in the wrong direction, since it supplies us with 
a new, much larger class of fundamentally unneeded abstract mathematical entities far removed 
from the basic ingredients of Bohmian mechanics. However from the perspective of Bohmian 
phenomenology positive-operator-valued measures form an extremely natural class of objects — 
indeed more natural than projection-valued measures. 

To see how this comes about observe that ()2.18p defines a family of bounded linear operators 
Ra by 

%J [U{^ ® $o)] = {Ra^) ® ^o., (4.2) 

in terms of which we may rewrite the probability ()2.20|) of obtaining the result (distinct) in a 
generic discrete experiment as 

p« = = WRM? = {^,R*aRc.i^) ■ (4.3) 

By the unitarity of the overall evolution of system and apparatus we have that X^all'^oP ~ 
'^ai'^, Rl,Rai') = 1 for all ip eH, whence 

J2KRa = I. (4.4) 

a 

The operators Oa = R^Ra are obviously positive, i.e., 

{iP, Oa^) > for all ^eH (4.5) 
and by (j4.4j) sum up to the identity, 

J20o. = I. (4.6) 

a 

Thus we may associate with a generic discrete experiment — with no assumptions about repro- 
ducibility or anything else, but merely unitarity — a POVM 

0(A) = ^ 0„ = ^ KRa, (4.7) 

in terms of which the statistics of the results can be expressed in a compact way: the probability 
that the result of the experiment lies in a set A is given by 

Moreover, it follows from (|2.18p and (|4.2|) that S' generates state transformations 

ij^iP^ = R^i:. (4.9) 
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4.2 Formal Experiments 



The association between experiments and POVMs can be extended to a general experiment ()2.34|1 
in a straightforward way. In analogy with ()2.37|) we shall say that the POVM O is associated 
with the experiment whenever the probability distribution ()2.35|) of the results of S is equal 
to the probability measure ()4.1|) generated by O, i.e.,^^ 

<g^0 if and only if = (4.10) 

We may now proceed as in Section 3 and analyze on a formal level the association ()4.10|) by 
introducing the notions of weak and strong formal experiment as the obvious generalizations of 
(D and dSSI): 

Any positive- operator-valued measure O defines the weak formal experiment S = O. 

Any set {\a} of not necessarily distinct real numbers (or vectors of real numbers) 

paired with any collection {Ra} of bounded operators on Ti such that Y^R^Ra = I (4.11) 

defines the strong formal experiment S = {Xa,Ra} with associated POVM (|/^. y| ) 

and state transformations ^.yj ). 

The notion of formal experiment is a genuine extension of that of formal measurement, the 
latter being the special case in which O is a PVM and R^Ra are the projections. 

Formal experiments share with formal measurements many features. This is so because all 
measure-theoretic properties of projection-valued measures extend to positive-operator-valued 
measures. For example, just as for PVMs, integration of real functions against positive-operator- 
valued measure is a meaningful operation that generates self-adjoint operators: for given real 
(and measurable) function /, the operator B = J f{X)0{dX) is a self-adjoint operator defined, 
say, by its matrix elements (0, Bip) = J Xfi^ .^{dX) for all (p and ip in TC, where /i^^^ is the complex 
measure p^^^{dX) = (0, 0{dX)ip). (We ignore the difficulties that might arise if / is not bounded.) 
In particular, with O is associated the self-adjoint operator 

Ao = j XO{dX). (4.12) 

It is however important to observe that this association (unlike the case of PVMs, for which 
the spectral theorem provides the inverse) is not invertible, since the self-adjoint operator Aq is 
always associated with the PVM provided by the spectral theorem. Thus, unlike PVMs, POVMs 
are not equivalent to self-adjoint operators. In general, the operator Aq will carry information 
only about the mean value of the statistics of the results, 

j X{ij,0{dX)ij) = {ij,Aoi^), 
while for the higher moments we should expect that 

I A" {^|J,0{dX)^|J)^{^,A-o^) 

Whenever H4.10|l is satisfied we may say that the experiment S" is a. measurement of the generalized observable 
O. We shall however avoid this terminology in connection with generalized observables; even when it is standard 
(so that we use it), i.e., when O is a PVM and thus equivalent to a self-adjoint operator, it is in fact improper. 
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unless O is a PVM. 

What we have just described is an important difference between general formal experiments 
and formal measurements. This and other differences originate from the fact that a POVM is a 
much weaker notion than a PVM. For example, a POVM O on — like ordinary measures and 
unlike PVMs — need not be a product measure: If Oi, . . . , Om are the marginals of O, 

Oi(Ai) = 0(Ai X , . . . , O^(A^) = 0(M™-i x A^), 

the product POVM Oi x ■ • ■ x Om will be in general different from O. (This is trivial since any 
probability measure on R*" times the identity is a POVM.) 

Another important difference between the notion of POVM and that of PVM is this: while the 
projections -P(A) of a PVM, for different A's, commute, the operators 0(A) of a generic POVM 
need not commute. An illustration of how this may naturally arise is provided by sequential 
measurements. 

A sequential measurement (see Section EIHI) M-n ® ■ ■ ■ ® M.i is indeed a very simple example 
of a formal experiment that in general is not a formal measurement (see also Davies ^21J). We 
have that 

Mn® ■ ■ - ^ Mi = {\a,Ra} 

where 




and 

Re = R^:lUt^-t^_MZ~-^ ■ ■ ■ RS -Ut-. 

Note that since Pa = ll-RaV'IP) we have that 

R*aRa = I 

a 

, which also follows directly using 

E^S^*< = ^' 3 = 1,. ..,n 

Now, with Mn ® . . . ® A^i is associated the POVM 

0(A) = J2 KRa. 

Note that 0(A) and O(A') in general don't commute since in general Ra and Rp may fail to do 
so. 

An interesting class of POVMs for which 0(A) and O(A') do commute arises in association 
with the notion of an ^^approximate measurement" of a self-adjoint operator: suppose that the 
result Z of a measurement Ai = of a self-adjoint operator A is distorted by the addition of 
an independent noise with symmetric probability distribution //(A). Then the result Z + A^ of 
the experiment, for initial system wave function ip, is distributed according to 

[ [ r]i\-\'){^,PA{dX')ij)dX, 
J A 7k 
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which can be rewritten as 

A H-> /" r]{\-A)d\ i)) . 

J A 

Thus the result Z + N is governed by the POVM 

0(A) = [ r]{X-A)dX. (4.13) 

J A 

The formal experiment defined by this POVM can be regarded as providing an approximate 
measurement of A. For example, let 

v{X) = ^j=e-^. (4.14) 



Then for a — the POVM (j4.13p becomes the PVM of A and the experiment becomes a 
measurement of A. 

Concerning the POVM ()4.13|) we wish to make two remarks. The first is that the 0(A)'s 
commute since they are all functions of A. The second is that this POVM has a continuous 
density, i.e., 

0{dX) = o(A) d\ where o(A) = ri{\ - A) . 

This is another difference between POVMs and PVMs: like ordinary measures and unlike PVMs, 
POVMs may have a continuous density. The reason this is possible for POVMs is that, for a 
POVM O, unlike for a PVM, given e H, the vectors 0(A)V^ and 0{A')iIj, for A and A' disjoint 
and arbitrarily small, need not be orthogonal. Otherwise, no density o{dX) could exist, because 
this would imply that there is a continuous family {o{\)ijj} of orthogonal vectors in Ti. 

Finally, we observe that unlike strong measurements, the notion of strong formal experiment 
can be extended to POVM with continuous spectrum (see Section One may in fact define 
a strong experiment hj S = {A,-Ra}7 where A i-^ i?A is a continuous bounded- operator-valued 
function such that J R^Rx dX = I. Then the statistics for the results of such an experiment is 
governed by the POVM 0{d\) = RlRxdX. For example, let 

Rx = ^{\~A) where ^ (A) ^ ' — 



Then 0{d\) = RlRxdX is the POVM (KWi with r] given by KWi . We observe that the state 
transformations (cf. the definition ()2.6|) of the conditional wave function) 

ij^Rxi^ = — 1— e-^V (4.15) 

can be regarded as arising from a von Neumann interaction with Hamiltonian ()3.12j) (and 7T = 1) 
and ready state of the apparatus 

1 

My) 



Experiments with state transformations ()4.15|1 . for large a, have been considered by Aharonov and 
coworkers (see, e.g., Aharonov, Anandan, and Vaidman P) as providing "weak measurements" of 
operators. (The effect of the measurement on the state of the system is "small" if o is sufficiently 
large). This terminology notwithstanding, it is important to observe that such experiments are 
not measurements of A in the sense we have discussed here. They give information about the 
average value of A, since J A {iIj , R^Rxip) d\ = {ip,Aip), but presumably none about its higher 
moments. 
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4.3 From Formal Experiments to Experiments 

Just as with a formal measurement (see Section FH.^j) . with a formal experiment £ = {Xa,Ra}, 
we may associate a discrete experiment (o. The unitary map ()2.18|) of will be given again by 
(nmH) . i.e., 

?7 : 7/; ® $0 ^ ® $„, (4.16) 

a 

but now R^Ra of course need not be projection. The unitarity of U follows immediately from 
the orthonormality of the using ^ R^Ra = I ■ (Note that with a weak formal experiment 
£ = O = {Oa} we may associate many inequivalent discrete experiments, defined by ()4.1fi|l with 
operators Ra = Ua^/CT^, for any choice of unitary operators Ua-) 

We shall now discuss a concrete example of a discrete experiment defined by a formal exper- 
iment which will allow us to make some more further comments on the issue of reproducibility 
discussed in Section 

Let {. . . , e_i, Co, ei, . . . } be an orthonormal basis in the system Hilbert space 7i, let P_ , Pq > P+ 
be the orthogonal projections onto the subspaces 7i_, T^o, 'H^ spanned by {e}a<0) {co}, {e}o>o 
respectively, and let y+, V_ be the right and left shift operators. 

Consider the strong formal experiment £ with the two possible results A± = 
state transformations 

i?±i = ^±(P± + ^Po). 
Then the unitary U of the corresponding discrete experiment S is given by 

U : (g) $0 ^ R-i) ® + R+i) ® 

where $o is the ready state of the apparatus and are the apparatus states associated with 
the results ±1. If we now consider the action of U on the basis vectors Cq,, 

[/(e«®$o) 
f/(ea ® $o) 

f/(eo ® $o) 

we see immediately that 

Thus ()2.24|) is satisfied and £' is a, reproducible experiment. Note however that the POVM 
O = {0_i,0+i} associated with ()4.17|) . 

0±i = R*j_^R±i = P± + -Pq , 

is not a PVM since the positive operators 0±i are not projections, i.e, O^.^ ^ 0±\. Thus ^ 
is not a measurement of any self-adjoint operator, which shows that without the assumption of 
the finite dimensionality of the subspaces TYq a reproducible discrete experiment need not be a 
measurement of a self-adjoint operator. 



±1 and associated 
(4.17) 



= 4f(ei®$+ + e_i 



for a > 
for a < 
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4.4 Measure- Valued Quadratic Maps 



We conclude this section with a remark about POVMs. Via ()4.H1 every POVM O defines a 
"normahzed quadratic map" from Ti to measures on some space (the value-space for the POVM). 
Moreover, every such map comes from a POVM in this way. Thus the two notions are equivalent: 



defines a canonical one-to-one correspondence between POVMs and normal- 
ized measure-valued quadratic maps on Ti. 

To say that a measure- valued map on Ti 

(4.19) 

is quadratic means that 

/i^ = 5(^,^) (4.20) 

is the diagonal part of a sesquilinear map B, from 7i x 7i to the complex measures on some 
value space A. If B{%1),%1)) is a probability measure whenever = 1, we say that the map is 
normalized. 

Proposition ()4.18|) is a consequences of the following considerations: For a given POVM O the 
map ip ^ /^^; where /U^(A) = {ip,0{A)ip), is manifestly quadratic, with B{(j),ip) = {(j),0{-)ip), 
and it is obviously normalized. Conversely, let i— /i^ be a normalized measure- valued quadratic 
map, corresponding to some B, and write BA{(j),ip) = B{(f),ip)[A] for the complex measure B at 
the Borel set A. By the Schwartz inequality, applied to the positive form Ba{4>, ip)-, "we have that 
|Sa(0, V')! ^ II'^IIII0II- Thus, using Riesz's lemma jTU], there is a unique bounded operator 0(A) 
on Ti such that 

5a(0,^) = (0,O(A)V'). 

Moreover, 0(A), like -Ba, is countably additive in A, and since B{ip^ip) is a (positive) measure, 
O is a positive-operator-valued measure, normalized because B is. 
A simple example of a normalized measure-valued quadratic map is 

^ ^ p^{dq) = \m\^dq, (4.21) 

whose associated POVM is the PVM P*^ for the position (configuration) operator 

Q^>{q) = q^>{q). (4.22) 

Note also that if the quadratic map //^ corresponds to the POVM O, then, for any unitary 
C/, the composite map ip i— >• /i^^^ corresponds to the POVM U*OU, since {Uip,0{A)U4') = 
U*0{A.)Uip) . In particular for the map ()4.21|) and U = Ut, the composite map corresponds 
to the PVM P*^^, with Qt = U*QU, the Heisenberg position (configuration) at time T, since 
U^P^Ut = P^tQUt^ 



sesquilinear map B{(f>,il)) is one that is linear in the second slot and conjugate linear in the first: 



Clearly any such normalized B can be chosen to be conjugate symmetric, B{'ip,(f)) — B((f>,'ip), without affecting 
its diagonal, and it follows from polarization that any such B must in fact be conjugate symmetric. 
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5 The General Emergence of Operators 



For Bohmian mechanics POVMs emerge naturally, not for discrete experiments, but for a general 
experiment ()2.34|) . To see how this comes about consider the probability measure ()2.35|) giving 
the probability distribution of the result Z = F{Qx) of the experiment, where Qt is the final 
configuration of system and apparatus and F is the calibration function expressing the numerical 
result, for example the orientation B of a pointer. Then the map 

^/J ^ = P^r o F-\ (5.1) 

from the initial wave function of the system to the probability distribution of the result, is 
quadratic since it arises from the sequence of maps 

^ ^ = ^ (g) $0 t-^ = ?7(^ O $o) p^^{dq) = "^^-^Tdq ^^ = p^j, o (5.2) 

where the middle map, to the quantum equilibrium distribution, is obviously quadratic, while 
all the other maps are linear, all but the second trivially so. Now, by ()4.18|) . the notion of 
such a quadratic map ()5.H1 is completely equivalent to that of a POVM on the system Hilbert 
space 7i. (The sesquilinear map B associated with ()5.2|1 is B{ilJi,ip2) = '^lx'^2Td(l ° F~^, where 

Thus the emergence and role of POVMs as generalized observables in Bohmian mechanics is 
merely an expression of the sesquilinearity of quantum equilibrium together with the linearity 
of the Schrodinger evolution. Thus the fact that with every experiment is associated a POVM, 
which forms a compact expression of the statistics for the possible results, is a near mathematical 
triviality. It is therefore rather dubious that the occurrence of POVMs — the simplest case of which 
is that of PVMs — as observables can be regarded as suggesting any deep truths about reality or 
about epistemology. 

An explicit formula for the POVM defined by the quadratic map (j^.lj) follows immediately 
from (|^ : 

4(c/A) = {ij®<i>o,U*P^{F-\d\))U^(»<i>o) = {ij^<i>o,PoU*P^{F-\dX))UPoij®<^o) 

where P^ is the PVM for the position (configuration) operator (|4.22p and Pq is the projection 
onto ?i ® $0, whence 

OidX) = liJPo U*P'^iF-\dX))UPoU, , (5.3) 

where 1<s>qiP = ® $o is the natural identification of Ti with Ti ® $o- This is the obvious POVM 
reflecting the essential structure of the experiment. 

^^This POVM can also be written as 

0(dX) ^tTA \PQU*P'^{F-\dX))u] , (5.4) 

where tr a is the partial trace over the apparatus variables. The partial trace is a map ti a ■ W t—^ tryi(W^), 
from trace class operators on the Hilbert space Tis (?) T^a to trace class operators on Hs, uniquely defined by 
tr s(tr a(M^)-B) = tr s+a{WB ® /), where tr s+A and trg are the usual (scalar- valued) traces of operators on 
Tis (8) Ti-A and Hs, respectively. For a trace class operator B on L'^{dx) (g) L?{dy) with kernel B{x,y,x' ,y') we 
have trA{B) {x,x') = J B{x,y,x' ,y)dy. In 1)5. 4|l tr^ is applied to operators that need not be trace class — nor 
need the operator on the left be trace class — since, e.g., 0(A) = /. The formula nonetheless makes sense. 
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Note that the POVM ()5.3|) is unitarily equivalent to 

PoP^'^^^\dX)Po (5.5) 

where Qt is the Heisenberg configuration of system and apparatus at time T. This POVM, acting 
on the subspace TC ® $0) is the projection to that subspace of a PVM, the spectral projections 
for F{Qt)- Naimark has shown (see, e.g., [25) that every POVM is equivalent to one that arises 
in this way, as the orthogonal projection of a PVM to a subspace. 

We shall now illustrate the association of POVMs with experiments by considering some 
special cases of fl5.2|) . 

5.1 "No Interaction" Experiments 

Let f/ = f/^ (S) f/yi in ()5.2j) (hereafter the indices "S"' and "A" shall refer, respectively, to system 
and apparatus). Then for F{x, y) = y the measure- valued quadratic map defined by ()5.2|1 is 

ip ^ c{y)\\i)\\^dy 

where c(j/) = |[/a$oP(2/)) with POVM Oi{dy) = c{y)dy Is, while for F{q) = q = {x,y) the map 
is 

ijj c{y) \Usij\'^ix)dq 

with corresponding POVM 02{dq) = c{y) UgP^{dx)Usdy. Neither d nor O2 is a PVM. How- 
ever, if F is independent of y, F{x,y) = F{x), then the apparatus can be ignored in ()5.2|) or 
and O = U*sP^Us o F-\ i.e., 

0{dX) = U*sP^{F-\dX))Us, 

which is manifestly a PVM — in fact corresponding to F{Xt), where Xt is the Heisenberg con- 
figuration of the system at the end of the experiment. 

This case is somewhat degenerate: with no interaction between system and apparatus it 
hardly seems anything like a measurement. However, it does illustrate that it is "true" POVMs 
(i.e., those that aren't PVMs) that typically get associated with experiments — i.e., unless some 
special conditions hold (here that F = F{x)). 

5.2 "No X" Experiments 

The map ()5.2j) is well defined even when the system (the a;-system) has no translational degrees of 
freedom, so that there is no x (or X). This will be the case, for example, when the system Hilbert 
space Tis corresponds to the spin degrees of freedom. Then Tis = C" is finite dimensional. 

^Hi 0{dX) is a POVM on E acting on Ti., then the Hilbert space on which the corresponding PVM acts is 
the natural Hilbert space associated with the data at hand, namely L^{'E,,'H,0{dX)), the space of 7i-valued 
functions on S, with inner product given by / {ip{X) , 0{dX)(t){X)) . (If this is not, in fact, positive definite, 
then the quotient with its kernel should be taken — '0(A) should, in other words, be understood as the appropriate 
equivalence class.) Then 0{dX) is equivalent to PE{dX)P, where E{A) = 1a(A), multiphcation by 1a(A), and P 
is the orthogonal projection onto the subspace of constant 7i-valued functions ijj{X) = -ip- 
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In such cases, the cahbration F of course is a function of y alone, since there is no x. For 
F = y the measure-valued quadratic map defined by (|5.2|) is 

ij^\[U{ij^<^o)]{y)\'dy, (5.6) 

where | • ■ ■ | denotes the norm in C". 

This case is physically more interesting than the previous one, though it might appear rather 
puzzling since until now our measured systems have always involved configurations. After all, 
without configurations there is no Bohmian mechanics! However, what is relevant from a Bohmian 
perspective is that the composite of system and apparatus be governed by Bohmian mechanics, 
and this may well be the case if the apparatus has configurational degrees of freedom, even if 
what is called the system doesn't. Moreover, this case provides the prototype of many real- world 
experiments, e.g., spin measurements. 

For the measurement of a spin component of a spin-1/2 particle — recall the description of the 
Stern-Gerlach experiment given in SectionlTHl— we let Hs = the spin space, with "apparatus" 
configuration ?/ = x, the position of the particle, and with suitable calibration F{x). (For a real 
world experiment there would also have to be a genuine apparatus — a detector — that measures 
where the particle actually is at the end of the experiment, but this would not in any way affect 
our analysis. We shall elaborate upon this below.) The unitary U of the experiment is the 
evolution operator up to time T generated by the Pauli Hamiltonian ()2.12|) . which under the 
assumption ()2.14|) becomes 

H = -^V'-{b + az)a, (5.7) 
2m 

Moreover, as in Section 12. 5[ we shall assume that the initial particle wave function has the 
form $o(x) = ^o{z)4>{x,y).'^^ Then for -F(x) = z the quadratic map ()5.2j) is 

^ ^ (i(^+,^)n4+)(^)p+i(v^-,v^)n4-)(^)p)rf^ 

|^+)(^+||$it)(z)|2 + |^-)(^-||$(r)(z)|2^^ dz 



with POVM 



|4"^(z^i2 



where ijj^ are the eigenvectors ()2.13|) of and are the solutions of ()2.15p computed at t = T, 
for initial conditions $0'^^'' = ^oi^)- 

Consider now the appropriate calibration for the Stern-Gerlach experiment, namely the func- 
tion 

f(x)4"; (5.9) 
1—1 n z < 

which assigns to the outcomes of the experiment the desired numerical results: if the particle 
goes up in the z- direction the spin is while if the particle goes down the spin is -1. The 
corresponding POVM Ot is defined by 



^^We abuse notation here in using the notation y — :x. — {x,y,z). The y on the right should of course not be 
confused with the one on the left. 
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where 

/•oo /-oo 
p+= p-= \^T^-^\\z)dz. 

Jo Jo 
It should be noted that Ot is not a PVM. However, as indicated in Section as T ^ oo, 
— > 1 and —>■ 0, and the POVM Ot becomes the PVM of the operator a^, i.e., Ot —>■ P"" 
defined by 

. f 1 M . f M (5.10) 



^ y ' vol 

and the experiment becomes a measurement of the operator a 



5.3 "No y Experiments 

Suppose now that the "apparatus" involves no translational degrees of freedom, i.e., that there is 
no y (or Y). For example, suppose the apparatus Hilbert space TIa corresponds to certain spin 
degrees of freedom, with Ti^ = C" finite dimensional. Then, of course, F = F{x). 

This case illustrates what measurements are not. If the apparatus has no configurational 
degrees of freedom, then neither in Bohmian mechanics nor in orthodox quantum mechanics is 
it a bona fide apparatus: Whatever virtues such an apparatus might otherwise have, it certainly 
can't generate any directly observable results (at least not when the system itself is microscopic). 
According to Bohr ( JTj, pages 73 and 90): "Every atomic phenomenon is closed in the sense 
that its observation is based on registrations obtained by means of suitable amplification devices 
with irreversible functioning such as, for example, permanent marks on the photographic plate" 
and "the quantum-mechanical formalism permits well-defined applications only to such closed 
phenomena." To stress this point, discussing particle detection Bell has said 0: "Let us suppose 
that a discharged counter pops up a fiag sayings 'Yes' just to emphasize that it is a macroscopically 
different thing from an undischarged counter, in a very different region of configuration space." 

Experiments based on certain micro-apparatuses, e.g., "one-bit detectors" [ZHI, provide a nice 
example of "No Y" experiments. We may think of a one-bit detector as a spin-l/2-like system 
(e.g., a two-level atom), with "down" state $o (the ready state) and "up" state $i and which is 
such that its configurational degrees of freedom can be ignored. Suppose that this "spin-system," 
in its "down" state, is placed in a small spatial region Ai and consider a particle whose wave 
function has been prepared in such a way that at t = it has the form ip = ipi + ip2i where ipi is 
supported by Ai and ip2 by A2 disjoint from Ai. Assume that the particle interacts locally with 
the spin-system, in the sense that were = ipi the "spin" would fiip to the "up" state, while were 
'0 = V'2 it would remain in its "down" state, and that the interaction time is negligibly small, 
so that other contributions to the Hamiltonian can be ignored. Then the initial state t/^ ® $0 
undergoes the unitary transformation 

U : $0^^ = i)i®^i + %l)2®^o- (5.11) 

We may now ask whether U defines an experiment genuinely measuring whether the particle is 
in Ai or A2. The answer of course is no (since in this experiment there is no apparatus property at 
all with which the position of the particle could be correlated) unless the experiment is (quickly) 
completed by a measurement of the "spin" by means of another (macroscopic) apparatus. In 
other words, we may conclude that the particle is in Ai only if the spin-system in effect pops up 
a flag saying "up" . 
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5.4 "No Y no Experiments 



Suppose there is no apparatus at all: no apparatus configuration y nor Hilbert space Ha, or, 
what amounts to the same thing, Ha = C For calibration F = x the measure-valued quadratic 
map defined by ()5.2|) is 

ijj ^ \U^ix)\\ 

with POVM U*P^U, while the POVM for general calibration F{x) is 

0{dX) = U*P^{F'\dX))U . (5.12) 

O is a PVM, as mentioned in Section corresponding to the operator U*F{X)U = F{Xt), 
where Xt is the Heisenberg position (configuration) operator at time T. 

It is important to observe that even though these experiments suffer from the defect that no 
correlation is established between the system and an apparatus, this can easily be remedied — by 
adding a final detection measurement that measures the final actual configuration X^ — without 
in any way affecting the essential formal structure of the experiment. For these experiments 
the apparatus thus does not introduce any additional randomness, but merely reflects what was 
already present in Xt- All randomness in the flnal result 

Z = F{Xt) (5.13) 

arises from randomness in the initial conflguration of the system. 

For F = X and U = I the quadratic map is ^ \ip{x)\'^ with PVM , so that this (trivial) 
experiment corresponds to the simplest and most basic operator of quantum mechanics: the 
position operator. How other basic operators arise from experiments is what we are going to 
discuss next. 



5.5 The Basic Operators of Quantum Mechanics 

According to Bohmian mechanics, a particle whose wave function is real (up to a global phase), 
for example an electron in the ground state of an atom, has vanishing velocity, even though the 
quantum formalism assigns a nontrivial probability distribution to its momentum. It might thus 
seem that we are faced with a conflict between the predictions of Bohmian mechanics and those 
of the quantum formalism. This, however, is not so. The quantum predictions about momentum 
concern the results of an experiment that happens to be called a momentum measurement and 
a conflict with Bohmian mechanics with regard to momentum must reflect disagreement about 
the results of such an experiment. 

One may base such an experiment on free motion followed by a flnal measurement of posi- 
tion. Consider a particle of mass m whose wave function at t = is ip = ipi'x). Suppose no 

^^Though passive, the apparatus here plays an important role in recording the final configuration of the system. 
However, for experiments involving detections at different times, the apparatus plays an active role: Consider such 
an experiment, with detections at times ti, . . . , i„, and final result Z — F{Xt-^ , ■ ■ ■ , ^t„)- Though the apparatus 
introduces no extra randomness, it plays an essential role by changing the wave function of the system at the 
times ti, . . . ,tn and thus changing the evolution of its configuration. These changes are reflected in the POVM 
structure that governs the statistical distribution of Z for such experiments (see Section m^l . 

^''The emergence of the momentum operator in such so-called time-of-flight measurements was discussed by 
Bohm in his 1952 article |15|. A similar derivation of the momentum operator can be found in Feynman and 
Hibbs El. 
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forces are present, that is, that all the potentials acting on the particle are turned off, and let the 
particle evolve freely. Then we measure the position X^^ that it has reached at the time t = T. 
It is natural to regard = ^t/T and Pt = mX^T/T as providing, for large T, approximations 
to the asymptotic velocity and momentum of the particle. It turns out that the probability 
distribution of Pt, in the limit T — > oo, is exactly what quantum mechanics prescribes for the 
momentum, namely |?/'(p)p, where 



is the Fourier transform of ip. 

This result can be easily understood: Observe that IV^tIx)]^ c/x, the probability distribution 

of Xt, is the spectral measure /i^^((ix) = (■?/', P-^^((ix) ■?/') of X^ = U^'X.Ut-, the (Heisenberg) 

position operator at time t = T; here Ut is the free evolution operator and X is, as usual, the 
position operator at time t = 0. By elementary quantum mechanics (specifically, the Heisenberg 
equations of motion), X^ = ;^P^ + X, where P = — i^V is the momentum operator. Thus 
as T oo the operator mX^/T converges to the momentum operator P, since X/T is 0(1/T), 
and the distribution of the random variable Pt accordingly converges to the spectral measure of 
P, given by |V'(p)p.^^ 

The momentum operator arises from a. {T —>■ oo) limit of "no Y no $" single-particle exper- 
iments, each experiment being defined by the unitary operator Ut (the free particle evolution 
operator up to time T) and calibration -Ft(x) = myi/T. Other standard quantum-mechanical 
operators emerge in a similar manner, i.e., from a T ^ oo limit of appropriate single-particle 
experiments. 

This is the case, for example, for the spin operator a^- As in Section consider the evolu- 
tion operator Ut generated by Hamiltonian ()5.7p . but instead of ()5.9p . consider the calibration 
Ft{x.) = 2m z/ aT"^. This calibration is suggested by ()2.16p . as well as by the explicit form of 
the z-component of the position operator at time t = T, 

ZT = U-^ZUT = Z + —T+^a,T\ (5.14) 

m 2m 



which follows from the Heisenberg equations 

d^t dZ, 



m- 



dt^ dt 



Pz = -ih-^ , Zo 
oz 



t=0 

This formal argument can be turned into a rigorous proof by considering the hmit of the characteristic 
function of Pt, namely of the function /t(A) = / e^^'^ pridp), where pT is the distribution of ttiXt/T: frW — 

(^ip, exp ^iX ■ mX-T/Tj ip^, and using the dominated convergence theorem |70| this converges as T cxd to 

/(A) = (^ip^exp (^i X- implying the desired result. The same result can also be obtained using the well 

known asymptotic formula (see, e.g., |69p for the solution of the free Schrodinger equation with initial condition 

2 jT^x , 



/m\2 r/»7ix 



for T ^ oo. 
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Then, for initial state \1/ = ■?/' ® $o with suitable $0) where if) = atp^'^'^ + Pip^ \ the distribution 
of the random variable 

2 m Zt 
^zT = Ft[Xt) = — 
a I 

converges as T ^ oo to the spectral measure of az, with values +1 and —1 occurring with 
probabilities |ap and respectively.^^ This is so, just as with the momentum, because as 
T — >• cxD the operator converges to cx^. 

We remark that we've made use above of the fact that simple algebraic manipulations on the 
level of random variables correspond automatically to the same manipulations for the associated 
operators. More precisely, suppose that 

Z^A (5.15) 

in the sense (of fl2.37|) ) that the distribution of the random variable Z is given by the spectral 
measure for the self-adjoint operator A. Then it follows from ()3.17p that 

f{Z) - f{A) (5.16) 

for any (Borel) function /. For example, since ^ X^, m^T/T ^ m%T/T, and since Z^ — > 
Zt, ^ ^f^. Similarly, if a random variable P ^ P, then PV(2m) ^ Ho = P^/{2m). 

This is rather trivial, but it is not as trivial as the failure even to distinguish Z and Z would 
make it seem. 



5.6 From Positive-Operator- Valued Measures to Experiments 

We wish here to point out that to a very considerable extent the association ^ 0{dX) of 
experiments with POVMs is onto. It is more or less the case that every POVM arises from an 
experiment. 

We have in mind two distinct remarks. First of all, it was pointed out in the first paragraph 
of Section 4.3 that every discrete POVM Oa (weak formal experiment) arises from some discrete 
experiment ^. Thus, for every POVM 0{dX) there is a sequence (f of discrete experiments for 
which the corresponding POVMs O*^"-' converge to O. 

The second point we wish to make is that to the extent that every PVM arises from an 
experiment S' = {$05^)-^}; so too does every POVM. This is based on the fact, mentioned at 
the end of the introduction to Section 5, that every POVM 0{dX) can be regarded as arising 
from the projection of a PVM E[dX), acting on n^^\ onto the subspace H C n^^\ We may 
assume without loss of generality that both Ti and 7i*^^^ 7i are infinite dimensional (by some 
otherwise irrelevant enlargements if necessary). Thus we can identify Ti*-^^ with H '^^apparatus(i) 
and the subspace with Ti ® ^'o^'', for any choice of $o^^ Suppose now that there is an experiment 
^(1) = {^^^\U,F} that measures the PVM E (i.e., that measures the observable A = J XE{dX)) 

where $0 ^ ^apparatus(2) ) U acts OnH® T^apparatus whcrC TYapparatus = '^apparatus(l) ® ^apparatus(2) 

and F is a function of the configuration of the composite of the 3 systems: system, apparatus^^-* 
and apparatus^^). Then, with $0 = $0^^ ® ^o \ ^ = {*o, U, F} is associated with the POVM O. 

^^For the Hamiltonian H5.7|l no assumption on the initial state ^' is required here; however (|5.7I) wiU be a 
reasonably good approximation only when 4" has a suitable form, expressing in particular that the particle is 
appropriately moving towards the magnet. 
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5.7 Invariance Under Trivial Extension 

Suppose we change an experiment S to S' by regarding its x-system as containing more of the 
universe that the x-system for ^, without in any way ahering what is physically done in the 
experiment and how the result is specified. One would imagine that would be equivalent to S . 
This would, in fact, be trivially the case classically, as it would if S were a genuine measurement, 
in which case S' would obviously measure the same thing as S . This remains true for the more 
formal notion of measurement under consideration here. The only source of nontriviality in 
arriving at this conclusion is the fact that with we have to deal with a different, larger class 
of initial wave functions. 

We will say that is a trivial extension of S if the only relevant difference between S and 
S' is that the x-system for S' has generic configuration x' = {x,x), whereas the x-system for 
has generic configuration x. In particular, the unitary operator U' associated with ^' has the 
form U' = U ^U, where U is the unitary associated with (f, implementing the interaction of the 
x-system and the apparatus, while f/ is a unitary operator describing the independent evolution 
of the x-system, and the calibration F for (f' is the same as for S'. (Thus F does not depend 
upon X.) 

The association of experiments with (generalized) observables (POVMs) is invariant under 
trivial extension: if i-^ O in the sense of (j4.1(jp and <f' is a trivial extension of (o, then 
^ O ® 1 1 where I is the identity on the Hilbert space of the x-system. 

To see this note that if ^ i— O then the sesquilinear map B arising from ()5.2j) for S' is of the 
form 

on product wave functions ip' = ip <^ iJj, which easily follows from the form of U' and the fact 
that F doesn't depend upon x, so that the x-degrees of freedom can be integrated out. Thus the 
POVM O' for agrees with O / on product wave functions, and since such wave functions 
span the Hilbert space for the (x, x)-system, we have that O' = O ® F Thus S' ^ O ® F 

In other words, if 1^ is a measurement of O, then is a measurement oi 0®F In particular, 
if is a measurement the self-adjoint operator A, then S' is a measurement oi A®F This result 
is not quite so trivial as it would be were it concerned with genuine measurements, rather than 
with the more formal notion under consideration here. 

Now suppose that (S' is a trivial extension of a discrete experiment with state transforma- 
tions given by Ra- Then the state transformations for S" are given by R'^ = R^ ® U . This is so 
because R'^ must agree with Ra^U on product wave functions tfj' = tfj ® ip , and these span the 
Hilbert space of the (x, x)-system. 

5.8 POVMs and the Positions of Photons and Dirac Electrons 

We have indicated how POVMs emerge naturally in association with Bohmian experiments. 
We wish here to indicate a somewhat different role for a POVM: to describe the probability 
distribution of the actual (as opposed to measured^°) position. The probability distribution of 
the position of a Dirac electron in the state ip is 'ip^'^p- This is given by a PVM E{dx.) on the 
one-particle Hilbert space Ti. spanned by positive and negative energy electron wave functions. 
However the physical one-particle Hilbert-space Ti.^ consists solely of positive energy states, and 

■^"^The accurate measurement of the position of a Dirac electron is presumably impossible. 
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this is not invariant under the projections E. Nonetheless the probabihty distribution of the 
position of the electron is given by the POVM P^E{dx.)P^ acting on Ti^, where P+ is the 
orthogonal projection onto Similarly, constraints on the photon wave function require the 
use of POVMs for the localization of photons [5^ IH].^^ 

6 Density Matrices 

The notion of a density matrix, a positive (trace class) operator with unit trace on the Hilbert 
space of a system, is often regarded as providing the most general characterization of a quantum 
state of that system. According to the quantum formalism, when a system is described by the 
density matrix W, the expected value of an observable A is given by tr (VTA). If A has PVM O, 
and more generally for any POVM O, the probability that the (generalized) observable O has 
value in A is given by 

Prob(0 G A) = tr(iyO(A)). (6.1) 

A density matrix that is a one-dimensional projection, i.e., of the form where is a. unit 

vector in the Hilbert space of the system, describes a pure state (namely, ip), and a general 
density matrix can be decomposed into a mixture of pure states ipk, 

W = ^Pk\i'k){i'k\ where ^pk = l. (6.2) 

k k 

Naively, one might regard pk as the probability that the system is in the state ipk- This 
interpretation is, however, untenable, for a variety of reasons. First of all, the decomposition 
()6.2|) is not unique. A density matrix W that does not describe a pure state can be decomposed 
into pure states in a variety of different ways. 

It is always possible to decompose a density matrix W in such a way that its components ipk 
are orthonormal. Such a decomposition will be unique except when W is degenerate, i.e., when 
some pkS coincide. For example, if pi = p2 we may replace ipi and ^/'2 by any other orthonormal 
pair of vectors in the subspace spanned by ipi and iIj2- And even if W were nondegenerate, it 
need not be the case that the system is in one of the states ipk with probability pk, because for 
any decomposition (j6.2j) . regardless of whether the ipk are orthogonal, if the wave function of the 
system were ipk with probability p^, this situation would be described by the density matrix W. 

Thus a general density matrix carries no information — not even statistical information — about 
the actual wave function of the system. Moreover, a density matrix can describe a system that 
has no wave function at all! This happens when the system is a subsystem of a larger system 
whose wave function is entangled, i.e., does not properly factorize (in this case one usually speaks 
of the reduced density matrix of the subsystem). 

This impossibility of interpreting density matrices as real mixtures of pure states has been 
regarded by many authors (e.g., von Neumann [Tl] and Landau [Sn]) as a further indication that 
quantum randomness is inexplicable within the realm of classical logic and probability. However, 
from the point of view of Bohmian mechanics, there is nothing mysterious about density matrices. 
Indeed, their role and status within the quantum formalism can be understood very easily in terms 

^^For example, on the one-photon level, both the proposal 4* = E + iB (where E and B are the electric and 
the magnetic free fields) J^li and the proposal '4' = A (where A is the vector potential in the Coulomb gauge) 
P], require the constraint V • 'i' = 0. 
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of the general framework of experiments of Sectional (It can, we believe, be reasonably argued 
that even from the perspective of orthodox quantum theory, density matrices can be understood 
in a straightforward way.) 



6.1 Density Matrices and Bohmian Experiments 

Consider a general experiment S' ^ O (see equation ()4.1()|l ) and suppose that the initial wave 
function of the system is random with probability distribution p{dip) (on the set of unit vectors 
in Ti). Then nothing will change in the general argument of Section El except that now in 
()4.10|) and ()5.2|) should be interpreted as the conditional probability given ip. It follows then from 
()6.1|) . using the fact that {ip, 0{A)ip) = tr {\ip){ip\ 0(A)), that the probability that the result of 
S' lies in A is given by 

p(#) {tp, 0{A)^) = \^){^\ C(A)^ = tr {WO{A)) (6.3) 

where^^ 

W=[ p{d^) (6.4) 



is the ensemble density matrix arising from a random wave function with (ensemble) distribu- 
tion p. 

Now suppose that instead of having a random wave function, our system has no wave function 
at all because it is entangled with another system. Then there is still an object that can naturally 
be regarded as the state of our system, an object associated with the system itself in terms of 
which the results of experiments performed on our system can be simply expressed. This object 
is a density matrix W and the results are governed by ()6.H1 . W is the reduced density matrix 
arising from the state of the larger system. This is more or less an immediate consequence of 
invariance under trivial extension, described in Section IF!7I 

Consider a trivial extension of an experiment ^-^ O on our system — precisely what we 
must consider if the larger system has a wave function ip' while our (smaller) system does not. 
The probability that the result of S"' lies in A is given by 

{ip', 0(A) ® IiP') = tr' (|^')(^'l 0{A) ® /) = tr {WO{A)) , (6.5) 

where tr ' is the trace for the x'-system (the big system) and tr is the trace for the x-system. In 
agreement with standard quantum mechanics, the last equality of (j6.5|) defines W as the reduced 
density matrix of the x-system, i.e, 

W = Q (6.6) 
where tr denotes the partial trace over the coordinates of the x-system. 



■^^Note that since p is a probability measure on the unit sphere in 7i, is a positive trace class operator with 
unit trace. 
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6.2 Strong Experiments and Density Matrices 

A strong formal experiment £ = {Xa,Ra} generates state transformations Raip- This 

suggests the following action on an initial state described by a density matrix W: When the 
outcome is a, we have the transformation 

tr (TZaW) tr (RaWR-^) ^ ' ' 

where 

n^w = RoWK ■ (6-8) 

After all, ()6.7p is a density matrix naturally associated with R^ and W , and it agrees with 
■0 — *• RofllJ for a pure state, W = \ip){ip\. In order to show that ()6.7|) is indeed correct, we must 
verify it for the two different ways in which our system might be assigned a density matrix W, 
i.e., for W an ensemble density matrix and for W a reduced density matrix. 

Suppose the initial wave function is random, with distribution p^dip). Then the initial 
state of our system is given by the density matrix ()6.4|) . When the outcome a is obtained, 
two changes must be made in ()6.4j) to reflect this information: must be replaced by 

{Ra\ij) {ip\R*a) IWRai^W^ 1 and p{dtp) must be replaced by p{dip\a), the conditional distribution of 
the initial wave function given that the outcome is a. For the latter we have 

(WRa'ipW'^pidip) is the joint distribution of and a and the denominator is the probability of 
obtaining the outcome a.) Therefore W undergoes the transformation 

w= p{dij)\^){^\ ^ / p{d^\a) .1 lu, = / pW) ° - 



Rai^V J ^' tT{RJ¥Rl) ti{R^WRiy 

We wish to emphasize that this demonstrates in particular the nontrivial fact that the density 
matrix 7^aiy/tr iJZaW) produced by the experiment depends only upon the initial density matrix 
W . Though W can arise in many different ways, corresponding to the multiplicity of different 
probability distributions p{dtp) yielding W via (j6.4j) . insofar as the final state is concerned, these 
differences don't matter. 

This does not, however, establish ()6.7p when W arises not from a random wave function but 
as a reduced density matrix. To deal with this case we consider a trivial extension of a discrete 
experiment ^ with state transformations Ra- Then ^' has state transformations Ra ® U (see 
Section r5.7|l . Thus, when the initial state of the x'-system is tp', the final state of the x-system 
is given by the partial trace 



Q (^R^(E)tJ\ij')mR*^(E)U*) _ tr {R^®I\^'){^'\R*^®I) _ R^G {\^'){^'\)R*^ 



ti' [Ra® U\ip') {ip'\Rl ® t>* 



tr {Ra ^ 


^I\ij'){ij'\R*^(^ 


5/) 


tr ' {Ra C 







RaWRl 



tr {RaWRl 

where the cyclicity of the trace has been used. 
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To sum up, when a strong experiment £ = {A^, Ra} is performed on a system described by 
the initial density matrix W and the outcome a is obtained, the final density matrix is given by 
(j6.7|) : moreover, from the results of the previous section it follows that the outcome a will occur 
with probability 

= tr (WOo,) = tr {WR*M = tr (TZaW) , (6.9) 
where the last equality follows from the cyclicity of the trace. 

6.3 The Notion of Instrument 

We shall briefiy comment on the relationship between the notion of strong formal experiment 
and that of instrument (or effect) discussed by Davies j^I] . 

Consider an experiment £ = {Aq,/?^} on a system with initial density matrix W. Then a 
natural object associated with £ is the set function 

7^(A)w^ = J2 T^c.w = J2 Rc^wK ■ (6.10) 

The set function TZ : ^(^) compactly expresses both the statistics of £ for a general initial 
system density matrix W and the effect of £ on conditioned on the occurrence of the event 
"the result of £ is in A" . 

To see this, note first that it follows from ()6.9|1 that the probability that the result of the 
experiment lies in the set A is given by 

p(A) = tr (7^(A)W^) . 

The conditional distribution p(q;|A) that the outcome is a given that the result A^ G A is then 
{JZaW) /ti {1Z{/\)W). The density matrix that refiects the knowledge that the result is in a, 
obtained by averaging ()6.7|) over A usingp(a|A), is thus TZ{/S.)W /ti {TZ{/S.)W). 

It follows from ()6.10|) that 7^ is a countably additive set function whose values are positive 
preserving linear transformations in the space of trace-class operators in Ti. Any map with these 
properties, not necessarily of the special form (jfj.lOj) . is called an instrument. 

6.4 On the State Description Provided by Density Matrices 

So far we have followed the standard terminology and have spoken of a density matrix as de- 
scribing the state of a physical system. It is important to appreciate, however, that this is merely 
a frequently convenient way of speaking, for Bohmian mechanics as well as for orthodox quan- 
tum theory. Insofar as Bohmian mechanics is concerned, the significance of density matrices is 
neither more nor less than what is implied by their role in the quantum formalism as described 
in Sections 16.11 and 16.21 While many aspects of the notion of (effective) wave function extend 
to density matrices, in particular with respect to weak and strong experiments, density matrices 
lack the dynamical implications of wave functions for the evolution of the configuration, a point 
that has been emphasized by Bell 7J: 

In the de Broglie-Bohm theory a fundamental significance is given to the wave func- 
tion, and it cannot be transferred to the density matrix. ... Of course the density 
matrix retains all its usual practical utility in connection with quantum statistics. 
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That this is so should be reasonably clear, since it is the wave function that determines, in 
Bohmian mechanics, the evolution of the configuration, and the density matrix of a system does 
not determine its wave function, even statistically. To underline the point we shall recall the 
analysis of Bell f^: Consider a particle described by a density matrix Wt evolving autonomously, 
so that Wt = UtWoU^^, where Ut is the unitary group generated by a Schrodinger Hamiltonian. 
Then p^^{x) = Wt{x,x) = {x\Wt\x) gives the probability distribution of the position of the 
particle. Note that satisfies the continuity equation 

^ + div = where ix) = -Im [V^W(x, x')]._^ . 

at m 

This might suggest that the velocity of the particle should be given by f = J^/p^, which 
indeed agrees with the usual formula when is a pure state {W{x,x') = ilj{x)tlj*{x')). How- 
ever, this extension of the usual formula to arbitrary density matrices, though mathemati- 
cally "natural," is not consistent with what Bohmian mechanics prescribes for the evolution 
of the configuration. Consider, for example, the situation in which the wave function of a 
particle is random, either ipi or ip2, with equal probability. Then the density matrix is 
W{x,x') = ^ {■iIji{x)i/jI{x') + iIj2{x)iIj2{^'))- But the velocity of the particle will be always ei- 
ther vi or V2 (according to whether the actual wave function is ipi or 1IJ2), and — unless ipi and ip2 
have disjoint supports — this does not agree with / p^ , an average of vi and V2. 

What we have just said is correct, however, only when spin is ignored. For particles with spin a 
novel kind of density matrix emerges, a conditional density matrix, analogous to the conditional 
wave function (|2.6p and with an analogous dynamical role: Even though no conditional wave 
function need exist for a system entangled with its environment when spin is taken into account, 
a conditional density matrix W always exists, and is such that the velocity of the system is indeed 
given by / p^ . See |31j for details. 

A final remark: the statistical role of density matrices is basically different from that provided 
by statistical ensembles, e.g, by Gibbs states in classical statistical mechanics. This is because, 
as mentioned earlier, even when it describes a random wave function via (j6.4|) . a density matrix 
W does not determine the ensemble p{dip) from which it emerges. The map defined by ()6.4|) 
from probability measures p on the unit sphere in Ti to density matrices W is many-to-one. 
Consider, for example, the density matrix -/ where I is the identity operator on an n-dimensional 
Hilbert space Ti. Then a uniform distribution over the vectors of any given orthonormal basis 
of Ti leads to this density matrix, as well as does the continuous uniform measure on the sphere 
1 1 -01 1 = 1. However, since the statistical distribution of the results of any experiment depends 
on p only through W, different p's associated with the same W are empirically equivalent in the 
sense that they can't be distinguished by experiments performed on a system prepared somehow 
in the state W. 



■^^This is relevant to the foundations of quantum statistical mechanics, for which the state of an isolated 
thermodynamic system is usually described by the microcanonical density matrix Z~^5{H — E), where Z = 
tv5{H — E) is the partition function. Which ensemble of wave functions should be regarded as forming the 
thermodynamic ensemble? A natural choice is the uniform measure on the subspace H = E, which should be 
thought of as fattened in the usual way. Note that this choice is quite distinct from another one that people often 
have in mind: a uniform distribution over a basis of energy eigenstates of the appropriate energy. Depending 
upon the choice made, we obtain different notions of typical equilibrium wave function. 
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7 Genuine Measurements 



We have so far discussed various interactions between a system and an apparatus relevant to the 
quantum measurement formahsm, from the very special ones formalized by "ideal measurements" 
to the general situation described in section 5. It is important to recognize that nowhere in this 
discussion was there any implication that anything was actually being measured. The fact that 
an interaction with an apparatus leads to a pointer orientation that we call the result of the 
experiment or "measurement" in no way implies that this result reflects anything of significance 
concerning the system under investigation, let alone that it reveals some preexisting property of 
the system — and this is what is supposed to be meant by the word measurement. After all [72], 
"any old playing around with an indicating instrument in the vicinity of another body, whereby 
at any old time one then takes a reading, can hardly be called a measurement of this body," 
and the fact the experiment happens to be associated, say, with a self-adjoint operator in the 
manner we have described, so that the experiment is spoken of, in the quantum formalism, as a 
measurement of the corresponding observable, certainly offers little support for using language 
in this way. 

We shall elaborate on this point later on. For now we wish to observe that the very gener- 
ality of our analysis, particularly that of section 5, covering as it does all possible interactions 
between system and apparatus, covers as well those particular situations that in fact are genuine 
measurements. This allows us to make some definite statements about what can be measured in 
Bohmian mechanics. 

For a physical quantity, describing an objective property of a system, to be measurable means 
that it is possible to perform an experiment on the system that measures the quantity, i.e., 
an experiment whose result conveys its value. In Bohmian mechanics a physical quantity ^ is 
expressed by a function 

e = /(^,v^) (7.1) 

of the complete state (X, ip) of the system. An experiment S' measuring ^ is thus one whose 
result Z = F{Xt, It) = Z{X, Y, ^) equals ^ = /(X, ^) = ^(X, V^), 

Z(X,y,vl>)=e(X,^), (7.2) 

where X, Y, ip and \E' refer, as in Section 5, to the initial state of system and apparatus, imme- 
diately prior to the measurement, and where the equality should be regarded as approximate, 
holding to any desired degree of accuracy. 

The most basic quantities are, of course, the state components themselves, namely X and ip, 
as well as the velocities 

Vfc = — Im— — — 7.3 

of the particles. One might also consider quantities describing the future behavior of the system, 
such as the configuration of an isolated system at a later time, or the time of escape of a particle 
from a specified region, or the asymptotic velocity discussed in Section l^TSl (Because the dynamics 
is deterministic, all of these quantities are functions of the initial state of the system and are 
thus of the form ()7.1|) .) 

We wish to make a few remarks about the measurability of these quantities. In particular, 
we wish to mention, as an immediate consequence of the analysis at the beginning of Section 5, 
a condition that must be satisfied by any quantity if it is to be measurable. 
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7.1 A Necessary Condition for Measurability 

Consider any experiment S measuring a physical quantity ^. We showed in Section 5 that the 
statistics of the result Z oi S must be governed by a POVM, i.e., that the probability distribution 
of Z must be given by a measure-valued quadratic map on the system Hilbert space 7i. Thus, 
by (Q, 

^ is measurable only if its probability distribution fi^ is a measure-valued quadratic 
map on Ti. 

As indicated earlier, the position X and the asymptotic velocity or momentum P have distri- 
butions quadratic in ijj, namely /ix('ix) = |'?/'(x)p and /ip((ip) = |'?/'(p)p, respectively. Moreover, 
they are both measurable, basically because suitable local interactions exist to establish appro- 
priate correlations with the relevant macroscopic variables. For example, in a bubble chamber 
a particle following a definite path triggers a chain of reactions that leads to the formation of 
(macroscopic) bubbles along the path. 

The point we wish to make now, however, is simply this: the measurability of these quantities 
is not a consequence of the fact that these quantities obey this measurability condition. We em- 
phasize that this condition is merely a necessary condition for measurability, and not a sufficient 
one. While it does follow that if ^ satisfies this condition there exists a discrete experiment that 
is an approximate formal measurement of ^ (in the sense that the distribution of the result of 
the experiment is approximately /x^), this experiment need not provide a genuine measurement 
of because the interactions required for its implementation need not exist and because, even if 
they did, the result Z of the experiment might not be related to the quantity ^ in the right way, 
i.e, via (Q. 

We now wish to illustrate the use of this condition, first transforming it into a weaker but 
more convenient form. Note that any quadratic map must satisfy 

and thus if ii^ is also positive we have the inequality 

Thus it follows from (j7.4j) that a quantity^^ 



(7.6) 



^ must fail to be measurable if it has a possible value (one with nonvanishing 
probability or probability density) when the wave function of the system is ipi +ip2 
that is neither a possible value when the wave function is ipi nor a possible value 
when the wave function is il)2- 

(Here neither ipi nor 11)2 need be normalized.) 

^^This conclusion is also a more or less direct consequence of the linearity of the Schrodinger evolution: If 
'4^i®^o^ for all z, then '^^i ® ^ ^ ^i- But, again, our purpose here has been mainly to illustrate the 
use of the measurability condition itself. 
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7.2 The Nonmeasurability of Velocity, Wave Function and Determin- 
istic Quantities 

It is an immediate consequence of ()7.6j) that neither the velocity nor the wave function is mea- 
surable, the latter because the value + '?/'2" is neither "t/'i" nor "'?/'2," and the former because 
every wave function ip may be written ip = ipi + ip2 where ipi is the real part of ip and 'ip2 is i 
times the imaginary part of if), for both of which the velocity (of whatever particle) is 0. 

Note that this is a very strong and, in a sense, surprising conclusion, in that it establishes the 
impossibility of measuring what is, after all, a most basic dynamical variable for a deterministic 
mechanical theory of particles in motion. It should probably be regarded as even more surprising 
that the proof that the velocity — or wave function — is not measurable seems to rely almost on 
nothing, in effect just on the linearity of the evolution of the wave function. However, one should 
not overlook the crucial role of quantum equilibrium. 

We observe that the nonmeasurability of the wave function is related to the impossibility of 
copying the wave function. (This question arises sometimes in the form, "Can one clone the wave 
function?" [SnilZHlini!-) Copying would be accomplished, for example, by an interaction leading, 
for all ip, from ® 0o ® to ip^ip^^, but this is clearly incompatible with unitarity. We wish 
here merely to remark that the impossibility of cloning can also be regarded as a consequence of 
the nonmeasurability of the wave function. In fact, were cloning possible one could — by making 
many copies — measure the wave function by performing suitable measurements on the various 
copies. After all, any wave function ijj is determined by Aip) for sufficiently many observables 
A and these expectation values can of course be computed using a sufficiently large ensemble. 

By a deterministic quantity we mean any function ^ = f{ip) of the wave function alone (which 
thus does not inherit any irreducible randomness associated with the random configuration X). It 
follows easily from ()7.6|1 that no (nontrivial) deterministic quantity is measurable. In particular, 
the mean value Aip) of an observable A (not a multiple of the identity) is not measurable — 
though it would be were it possible to copy the wave function, and it can of course be measured 
by a nonlinear experiment, see Section [731 

7.3 Initial Values and Final Values 

Measurement is a tricky business. In particular, one may wonder how, if it is not measurable, we 
are ever able to know the wave function of a system — which in orthodox quantum theory often 
seems to be the only thing that we do know about it. 

In this regard, it is important to appreciate that we were concerned in the previous section 
only with initial values, with the wave function and the velocity prior to the measurement. We 
shall now briefly comment upon the measurability of flnal values, produced by the experiment. 

The nonmeasurability argument of Section 17.21 does not cover flnal values. This may be 
appreciated by noting that the crucial ingredient in the analysis involves a fundamental time- 
asymmetry: The probability distribution fi^ of the result of an experiment is a quadratic func- 
tional of the initial wave function ip, not the flnal one — of which it is not a functional at all. 
Moreover, the flnal velocity can indeed be measured, by a momentum measurement as described 
in Section 15.51 (That such a measurement yields also the flnal velocity follows from the formula 
in footnote I2H1 for the asymptotic wave function.) And the flnal wave function can be measured 

35Note also that /x^ (^A) = d{X - f{'4'))dX seems manifestly nonquadratic in ip (unless / is constant). 
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by an ideal measurement of any nondegenerate observable, and more generally by any strong 
formal measurement whose subspaces Tia are one-dimensional, see Section 231 If the outcome is 
a, the final wave function is Ra'4' = RaP'Ha'4'i which is independent of the initial wave function 
■0 (up to a scalar multiple). 

We also wish to remark that this distinction between measurements of initial values and 
measurements of final values has no genuine significance for passive measurements, that merely 
reveal preexisting properties without in any way affecting the measured system. However, quan- 
tum measurements are usually active; for example, an ideal measurement transforms the wave 
function of the system into an eigenstate of the measured observable. But passive or active, a 
measurement, by its very meaning, is concerned strictly speaking with properties of a system 
just before its performance, i.e., with initial values. At the same time, to the extent that any 
property of a system is conveyed by a typical quantum "measurement," it is a property defined 
by a final value. 

For example, according to orthodox quantum theory a position measurement on a particle 
with a spread-out wave function, to the extent that it measures anything at all, measures the 
final position of the particle, created by the measurement, rather than the initial position, which 
is generally regarded as not existing prior to the measurement. And even in Bohmian mechanics, 
in which such a measurement may indeed reveal the initial position, which — if the measurement 
is suitably performed — will agree with the final position, this measurement will still be active 
since the wave function of the system must be transformed by the measurement into one that is 
compatible with the sharper knowledge of the position that it provides, see Section 2.1. 

7.4 Nonlinear Measurements and the Role of Prior Information 

The basic idea of measurement is predicated on initial ignorance. We think of a measurement of 
a property of a system as conveying that property by a procedure that does not seriously depend 
upon the state of the system,^^ any details of which must after all be unknown prior to at least 
some engagement with the system. Be that as it may, the notion of measurement as codified 
by the quantum formalism is indeed rooted in a standpoint of ignorance: the experimental 
procedures involved in the measurement do not depend upon the state of the measured system. 
And our entire discussion of measurement up to now has been based upon that very assumption, 
that S itself does not depend on ifj (and certainly not on X). 

If, however, some prior information on the initial system wave function ip were available, we 
could exploit this information to measure quantities that would otherwise fail to be measurable. 
For example, for a single-particle system, if we somehow knew its initial wave function if) then 
a measurement of the initial position of the particle would convey its initial velocity as well, 
via ()7.3p — even though, as we have shown, this quantity isn't measurable without such prior 
information. 

By a nonlinear measurement or experiment S' = we mean one in which, unlike those 
considered so far, one or more of the defining characteristics of the experiment depends upon if). 

■^^This statement must be taken with a grain of salt. Some things must be known about the system prior to 
measurement, for example, that it is in the vicinity the measurement apparatus, or that an atom whose angular 
momentum we wish to measure is moving towards the relevant Stern Gerlach magnets, as well as a host of similar, 
often unnoticed, pieces of information. This sort of thing does not much matter for our purposes in this paper 
and can be safely ignored. Taking them into account would introduce pointless complications without affecting 
the analysis in an essential way. 
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For example, in the measurement of the initial velocity described in the previous paragraph, the 
calibration function F = F"^ depends upon ip.^'^ More generally we mig ht have that U = U'f' or 

The wave function can of course be measured by a nonlinear measurement — ^just let F^ = tp. 
Somewhat less trivially, the initial wave function can be measured, at least formally, if it is 
known to be a member of a given orthonormal basis, by measuring any nondegenerate observable 
whose eigenvectors form that basis. The proposals of Aharonov, Anandan and Vaidman pp 
for measuring the wave function, though very interesting, are of this character — they involve 
nonlinear measurements that depend upon a choice of basis containing ip — and thus remain 
controversial.^^ 



7.5 A Position Measurement that Does not Measure Position 

We began this section by observing that what is spoken of as a measurement in quantum theory 
need not really measure anything. We mentioned, however, that in Bohmian mechanics the 
position can be measured, and the experiment that accomplishes this would of course be a 
measurement of the position operator. We wish here to point out, by means of a very simple 
example, that the converse is not true, i.e., that a measurement of the position operator need 
not be a measurement of the position. 

Consider the harmonic oscillator in 2 dimensions with Hamiltonian 




Except for an irrelevant time-dependent phase factor, the evolution tpt is periodic, with period 
r = 271 /uj. The Bohm motion of the particle, however, need not have period r. For example, the 
[n = l,m = l)-state, which in polar coordinates is of the form 

0) = ^re-^^'e^^e-^i-*, (7.7) 

generates a circular motion of the particle around the origin with angular velocity h/{mr'^), and 
hence with periodicity depending upon the initial position of the particle — the closer to the origin, 
the faster the rotation. Thus, in general, 

Xr 7^ Xq. 

Nonetheless, and Xq are identically distributed random variables, since |'?/'rP = l^oP = l"^!^- 
We may now focus on two different experiments: Let ^ be a measurement of the actual 
position Xq, the initial position, and hence of the position operator, and let be an experiment 
beginning at the same time as <f but in which it is the position X^ at time r that is actually 
measured. Since for all ip the result of (f ' has the same distribution as the result of S' is also a 
measurement of the position operator. But S' is not a measurement of the initial position since 

■^^Suppose that Z\ = Fi{Qt) — X is the result of the measurement of the initial position. Then F''^ — o Fi 
where G^{-) = 

^*In one of their proposals the wave function is "protected" by a procedure that depends upon the basis; in 
another, involving adiabatic interactions, ip must be a nondegenerate eigenstate of the Hamiltonian H of the 
system, but it is not necessary that the latter be known. 
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the position at time r does not in general agree with the initial position: A measurement of the 
position at time r is not a measurement of the position at time 0. Thus, while a measurement 
of position is always a measurement of the position operator, 

A measurement of the position operator is not necessarily a genuine measurement of 
position! 

7.6 Theory Dependence of Measurement 

The harmonic oscillator example provides a simple illustration of an elementary point that is 
often ignored: in discussions of measurement it is well to keep in mind the theory under consid- 
eration. The theory we have been considering here has been Bohmian mechanics. If, instead, 
we were to analyze the harmonic oscillator experiments described above using different theories 
our conclusions about results of measurements would in general be rather different, even if the 
different theories were empirically equivalent. So we shall analyze the above experiment S'' in 
terms of various other formulations or interpretations of quantum theory. 

In strict orthodox quantum theory there is no such thing as a genuine particle, and thus there 
is no such thing as the genuine position of a particle. There is, however, a kind of operational 
definition of position, in the sense of an experimental setup, where a measurement device yields 
results the statistics of which are given by the position operator. 

In naive orthodox quantum theory one does speak loosely about a particle and its position, 
which is thought of — in a somewhat uncritical way — as being revealed by measuring the position 
operator. Any experiment that yields statistics given by the position operator is considered a 
genuine measurement of the particle's position. Thus S'' would be considered as a measurement 
of the position of the particle at time zero. 

The decoherent (or consistent) histories formulation of quantum mechanics [SSI lEHl SEj is 
concerned with the probabilities of certain coarse-grained histories, given by the specification 
of finite sequences of events, associated with projection operators, together with their times 
of occurrence. These probabilities are regarded as governing the occurrence of the histories, 
regardless of whether any of the events are measured or observed, but when they are observed, 
the probabilities of the observed histories are the same as those of the unobserved histories. The 
experiments S' and tf' are measurements of single-event histories corresponding to the position 
of the particle at time and at time r, respectively. Since the Heisenberg position operators 
Xt- = Xq for the harmonic oscillator, it happens to be the case, according to the decoherent 
histories formulation of quantum mechanics, that for this system the position of the particle at 
time r is the same as its position at time when the positions are unobserved, and that <§'' in 
fact measures the position of the particle at time (as well as the position at time r). 

The spontaneous localization or dynamical reduction models pHj HO] are versions of quantum 
theory in which there are no genuine particles; in these theories reality is represented by the 
wave function alone (or, more accurately, by entities entirely determined by the wave function). 
In these models Schrodinger's equation is modified by the addition of a stochastic term that 
causes the wave function to collapse during measurement in a manner more or less consistent 
with the quantum formalism. In particular, the performance of ^ or (S'' would lead to a random 
collapse of the oscillator wave function onto a narrow spatial region, which might be spoken of 

■^^This, and the failure to appreciate the theory dependence of measurements, has been a source of unfounded 
criticisms of Bohmian mechanics(see 1241 1^ ^. 
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as the position of the particle at the relevant time. But S' could not be regarded in any sense as 
measuring the position at time 0, because the localization does not occur for S' until time r. 



Finally we mention stochastic mechanics |64j, a theory ontologically very similar to Bohmian 

mechanics in that the basic entities with which it is concerned are particles described by their 
positions. Unlike Bohmian mechanics, however, the positions evolve randomly, according to a 
diffusion process. Just as with Bohmian mechanics, for stochastic mechanics the experiment S' 
is not a measurement of the position at time zero, but in contrast to the situation in Bohmian 
mechanics, where the result of the position measurement at time r determines, given the wave 
function, the position at time zero (via the Bohmian equation of motion), this is not so in 
stochastic mechanics because of the randomness of the motion. 

8 Hidden Variables 

The issue of hidden variables concerns the question of whether quantum randomness arises in a 
completely ordinary manner, merely from the fact that in orthodox quantum theory we deal with 
an incomplete description of a quantum system. According to the hidden-variables hypothesis, 
if we had at our disposal a sufficiently complete description of the system, provided by supple- 
mentary parameters traditionally called hidden variables, the totality of which is usually denoted 
by A, the behavior of the system would thereby be determined, as a function of A (and the 
wave function). In such a hidden- variables theory, the randomness in results of measurements 
would arise solely from randomness in the unknown variables A. On the basis of a variety of 
"impossibility theorems," the hidden-variables hypothesis has been widely regarded as having 
been discredited. 

Note that Bohmian mechanics is just such a hidden- variables theory, with the hidden variables 
A given by the configuration Q of the total system. We have seen in particular that in a Bohmian 
experiment, the result Z is determined by the initial configuration Q = {X, Y) of the system and 
apparatus. Nonetheless, there remains much confusion about the relationship between Bohmian 
mechanics and the various theorems supposedly establishing the impossibility of hidden variables. 
In this section we wish to make several comments on this matter. 

8.1 Experiments and Random Variables 

In Bohmian mechanics we understand very naturally how random variables arise in association 
with experiments: the initial complete state {Q, \&) of system and apparatus evolves deterministi- 
cally and uniquely determines the outcome of the experiment; however, as the initial configuration 
Q is in quantum equilibrium, the outcome of the experiment is random. 

A general experiment S' is then always associated a random variable (RV) Z describing its 
result. In other words, according to Bohmian mechanics, there is a natural association 



between experiments and RVs. Moreover, whenever the statistics of the result of (ff is governed 
by a self-adjoint operator A on the Hilbert space of the system, with the spectral measure of 
A determining the distribution of Z, for which we shall write Z A (see (I2.37|) ). Bohmian 
mechanics establishes thereby a natural association between S' and A 




S ^ Z, 




^ v-^ A. 
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While for Bohmian mechanics the result Z depends in general on both X and F, the initial 
configurations of the system and of the apparatus, for many real-world experiments Z depends 
only on X and the randomness in the result of the experiment is thus due solely to randomness in 
the initial configuration of the system alone. This is most obvious in the case of genuine position 
measurements (for which Z{X^ Y) = X). That in fact the apparatus need not introduce any extra 
randomness for many other real- world experiments as well follows then from the observation that 
the role of the apparatus in many real- world experiments is to provide suitable background fields, 
which introduce no randomness, as well as a final detection, a measurement of the actual positions 
of the particles of the system. In particular, this is the case for those experiments most relevant 
to the issue of hidden variables, such as Stern-Gerlach measurements of spin, as well as for 
momentum measurements and more generally scattering experiments, which are completed by a 
final detection of position. 

The result of these experiments is then given by a random variable 



where T is the final time of the experiment, on the probability space {r2,P}, where Q = {X} 
is the set of initial configurations of the system and ¥{dx) = {ipl'^dx is the quantum equilibrium 
distribution associated with the initial wave function ip of the system. For these experiments 
(see Section 15. 4|) the distribution of Z is always governed by a PVM, corresponding to some 
self-adjoint operator A, Z \—>- A, and thus Bohmian mechanics provides in these cases a natural 
map I— > A. 

8.2 Random Variables, Operators, and the Impossibility Theorems 

We would like to briefiy review the status of the so-called impossibility theorems for hidden 
variables, the most famous of which are due to von Neumann ^74j, Gleason |lTj, Kochen and 
Specker jHSl, and Bell jH]. Since Bohmian mechanics exists, these theorems can't possibly estab- 
lish the impossibility of hidden variables, the widespread belief to the contrary notwithstanding. 
What these theorems do establish, in great generality, is that there is no ^^good" map from self- 
adjoint operators on a Hilbert space 7i to random variables on a common probability space. 



where Za = Za{\) should be thought of as the result of "measuring A" when the hidden variables, 
that complete the quantum description and restore determinism, have value A. Different senses 
of "good" correspond to different impossibility theorems. 

For any particular choice of A, say Aq, the map (j8.1(J|) is transformed to a value map 



Concerning the most common of all real-world quantum experiments, scattering experiments, although they 
are completed by a final detection of position, this detection usually occurs, not at a definite time T, but at a 
random time, for example when a particle enters a localized detector. Nonetheless, for computational purposes 
the final detection can be regarded as taking place at a definite time T . This is a consequence of the flux-across- 
surfaces theorem ^1|2S1EZ1, which establishes an asymptotic equivalence between flux across surfaces (detection 
at a random time) and scattering into cones (detection at a definite time). 



Z = F{Xt) = G{X) , 



A^ Z 



Za 




A v{A) 
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from self- adjoint operators to real numbers (with v{A) = Z^(Ao)). The stronger impossibility 
theorems establish the impossibility of a good value map, again with different senses of "good" 
corresponding to different theorems. 

Note that such theorems are not very surprising. One would not expect there to be a "good" 
map from a noncommutative algebra to a commutative one. 

One of von Neumann's assumptions was, in effect, that the map ()8.10p be linear. While 
mathematically natural, this assumption is physically rather unreasonable and in any case is 
entirely unnecessary. In order to establish that there is no good map ()8.1()j) . it is sufficient to 
require that the map be good in the minimal sense that the following agreement condition is 
satisfied: 

Whenever the quantum mechanical joint distribution of a set of self- adjoint opera- 
tors {Ai, . . . , Am) exists, i.e., when they form a commuting family, the joint distribu- 
tion of the corresponding set of random variables, i.e., of [Z^^^ . . . , ^a^); agrees with 
the quantum mechanical joint distribution. 

The agreement condition implies that all deterministic relationships among commuting ob- 
servables must be obeyed by the corresponding random variables. For example, if A, B and C 
form a commuting family and C = AB, then we must have that Zc = Z^Zb since the joint 
distribution of Z^, Zb and Zq must assign probability to the set {(a,6, c) G M^jc 7^ ah}. 
This leads to a minimal condition for a good value map A t— > v{A), namely that it preserve 
functional relationships among commuting observables: For any commuting family Ai, . . . , A^, 
whenever f{Ai,...,Am) = (where / : R™' — >■ M represents a linear, multiplicative, or any 
other relationship among the A's), the corresponding values must satisfy the same relationship, 
fiviA^),...,viAm)) = 0. 

The various impossibility theorems correctly demonstrate that there are no maps, from self- 
adjoint operators to random variables or to values, that are good, merely in the minimal senses 
described above. 

We note that while the original proofs of the impossibility of a good value map, in particular 
that of the Kochen-Specker theorem, were quite involved, in more recent years drastically simpler 
proofs have been found (for example, by Peres ^67j, by Greenberg, Horne, and Zeilinger and 
by Mermin [021 )• 

In essence, one establishes the impossibility of a good map A Z^ ot A ^-^ v{A) by show- 
ing that the v{A)^s, or Z^'s, would have to satisfy impossible relationships. These impossible 
relationships are very much like the following: Z^ = Zb = Zc 7^ Z^- However no impossible 
relationship can arise for only three quantum observables, since they would have to form a com- 
muting family, for which quantum mechanics would supply a joint probability distribution. Thus 
the quantum relationships can't possibly lead to an inconsistency for the values of the random 
variables in this case. 

With four observables A, S, C, and D it may easily happen that [A, -B] = 0, C] = 0, 
[C, = 0, and [-D, A] = even though they don't form a commuting family (because, say, 
[A, C] 7^ 0). It turns out, in fact, that four observables suffice for the derivation of impossible 

'''-'^ Another natural sense of good map A v{A) is given by the requirement that t'(A) e sp(A), where 
A = {A\, . . . , Am) is a commuting family, v{A) — {v{Ai), . . . , v{Am)) G R™ and sp (A) is the joint spectrum of 
the family. That a map good in this sense is impossible follows from the fact that if a = (ai, . . . am) G sp (A), 
then ai, . . . a™ must obey all functional relationships for Ai, . . . , Am- 
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quantum relationships. Perhaps the simplest example of this sort is due to Hardy who 
showed that for almost every quantum state for two spin 1/2 particles there are four observables 
A, B, C, and D (two of which happen to be spin components for one of the particles while 
the other two are spin components for the other particle) whose quantum mechanical pair-wise 
distributions for commuting pairs are such that a good map to random variables must yield 
random variables Za, Zb, Zc, and Z^ obeying the following relationships: 

(1) The event {Za = 1 and Zb = 1} has positive probability (with an optimal choice of the 
quantum state, about .09). 

(2) If {Za = 1} then {Zd = 1}. 

(3) If {Zb = 1} then {Zc = 1}. 

(4) The event {Zd = 1 and Zc = 1} has probability 0. 

Clearly, there exist no such random variables. 

The point we wish to emphasize here, however, is that although they are correct and although 
their hypotheses may seem minimal, these theorems are nonetheless far less relevant to the 
possibility of a deterministic completion of quantum theory than one might imagine. In the 
next subsection we will elaborate on how that can be so. We shall explain why we believe such 
theorems have little physical significance for the issues of determinism and hidden variables. We 
will separately comment later in this section on Bell's related nonlocality analysis jH], which does 
have profound physical implications. 

8.3 Contextuality 

It is a simple fact there can be no map A ^-^ Za, from self-adjoint operators on Ti. (with dim (7i) > 
3) to random variables on a common probability space, that is good in the minimal sense that the 
joint probability distributions for the random variables agree with the corresponding quantum 
mechanical distributions, whenever the latter ones are defined. But does not Bohmian mechanics 
yield precisely such a map? After all, have we not emphasized how Bohmian mechanics naturally 
associates with any experiment a random variable Z giving its result, in a manner that is in 
complete agreement with the quantum mechanical predictions for the result of the experiment? 
Given a quantum observable A, let Za be then the result of a measurement of A. What gives? 

Before presenting what we believe to be the correct response, we mention some possible re- 
sponses that are off-target. It might be objected that measurements of different observables will 
involve different apparatuses and hence different probability spaces. However, one can simul- 
taneously embed all the relevant probability spaces into a huge common probability space. It 
might also be objected that not all self-adjoint operators can be realistically be measured. But 
to arrive at inconsistency one need consider, as mentioned in the last subsection, only 4 observ- 
ables, each of which are spin components and are thus certainly measurable, via Stern-Gerlach 
experiments. Thus, in fact, no enlargement of probability spaces need be considered to arrive at 
a contradiction, since as we emphasized at the end of Section IHITI the random variables giving the 
results of Stern-Gerlach experiments are functions of initial particle positions, so that for joint 
measurements of pairs of spin components for 2-particles the corresponding results are random 
variables on the common probability space of initial configurations of the 2 particles, equipped 
with the quantum equilibrium distribution determined by the initial wave function. 
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There must be a mistake. But where could it be? The mistake occurs, in fact, so early that it 
is difficult to notice it. It occurs at square one. The difficulty lies not so much in any conditions 
on the map A i— > Za, but in the conclusion that Bohmian mechanics supplies such a map at all. 

What Bohmian mechanics naturally supplies is a map S' i— > Z^, from experiments to random 
variables. When i— > A, so that we speak of # as a measurement of ^4 ^ A), this very 
language suggests that insofar as the random variable is concerned all that matters is that S' 
measures A, and the map Z£> becomes a map A i— Za- After all, if S' were a genuine 

measurement of A, revealing, that is, the preexisting (i.e., prior to the experiment) value of the 
observable A, then Z would have to agree with that value and hence would be an unambiguous 
random variable depending only on A. 

But this sort of argument makes sense only if we take the quantum talk of operators as 
observables too seriously. We have emphasized in this paper that operators do naturally arise in 
association with quantum experiments. But there is little if anything in this association, beyond 
the unfortunate language that is usually used to describe it, that supports the notion that the 
operator A associated with an experiment S' is in any meaningful way genuinely measured by 
the experiment. Prom the nature of the association itself, it is difficult to imagine what this 
could possibly mean. And for those who think they imagine some meaning in this talk, the 
impossibility theorems show they are mistaken. 

The bottom line is this: in Bohmian mechanics the random variables Z^ giving the results of 
experiments S' depend, of course, on the experiment, and there is no reason that this should not 
be the case when the experiments under consideration happen to be associated with the same 
operator. Thus with any self- adjoint operator A. Bohmian mechanics naturally may associate 
many different random variables Z^, one for each different experiment ^ A associated with 
A. A crucial point here is that the map S" ^ A'ls many-to-one.^^ 

Suppose we define a map A ^ Za by selecting, for each A, one of the experiments, call 
it S'ai with which A is associated, and define Za to be Z ^ ^. Then the map so defined can't 
be good, because of the impossibility theorems; moreover there is no reason to have expected 
the map to be good. Suppose, for example, that [^4, B] — 0. Should we expect that the joint 
distribution of Za and Zb will agree with the joint quantum mechanical distribution of A and 
Bl Only if the experiments a and S'b used to define Za and Zb both involved a common 
experiment that "simultaneously measures A and i?," i.e., an experiment that is associated with 
the commuting family (A, 5). If we consider now a third operator C such that [A,C] = 0, but 
[5, C] 7^ 0, then there is no choice of experiment # that would permit the definition of a random 
variable Za relevant both to a "simultaneous measurement of A and B" and a "simultaneous 
measurement of A and C" since no experiment is a "simultaneous measurement of A, B, and 
C." In the situation just described we must consider at least two random variables associated 
with A, Za,b and Za,c, depending upon whether we are considering an experiment "measuring 
A and B" or an experiment "measuring A and C." It should be clear that when the random 
variables are assigned to experiments in this way, the possibility of conflict with the predictions 
of orthodox quantum theory is eliminated. It should also be clear, in view of what we have 

^^We wish to remark that, quite aside from this many-to-oneness, the random variables Z^i cannot generally be 
regarded as corresponding to any sort of natural property of the "measured" system. Z^i, in general a function 
of the initial configuration of the system-apparatus composite, may fail to be a function of the configuration of 
the system alone. And even when, as is often the case, Z^i does depend only on the initial configuration of the 
system, owing to chaotic dynamics this dependence could have an extremely complex character. 
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repeatedly stressed, that quite aside from the impossibihty theorems, this way of associating 
random variables with experiments is precisely what emerges in Bohmian mechanics. 

The dependence of the result of a "measurement of the observable A" upon the other ob- 
servables, if any, that are "measured simultaneously together with A" — e.g., that Za,b and Za,c 
may be different — is called contextuality: the result of an experiment depends not just on "what 
observable the experiment measures" but on more detailed information that conveys the "con- 
text" of the experiment. The essential idea, however, if we avoid misleading language, is rather 
trivial: that the result of an experiment depends on the experiment. 

To underline this triviality we remark that for two experiments, and i^', that "measure 
A and only A" and involve no simultaneous "measurement of another observable," the results 
Z^ and Z ^1 may disagree. For example in Section 1731 we described experiments (S and S' both 
of which "measured the position operator" but only one of which measured the actual initial 
position of the relevant particle, so that for these experiments in general Z ^ Z gi . 

One might feel, however, that in the example just described the experiment that does not 
measure the actual position is somewhat disreputable — even though it is in fact a "measurement 
of the position operator." We shall therefore give another example, due to D. Albert 2j, in which 
the experiments are as simple and canonical as possible and are entirely on the same footing. 
Let and S ^ be Stern-Gerlach measurements of A = a^, with ^ differing from only in that 
the polarity of the Stern-Gerlach magnet for S ^ is the reverse of that for S^. (In particular, the 
geometry of the magnets for and S ^ is the same.) If the initial wave function '?/'symm and the 
magnetic field ±S have sufficient reflection symmetry with respect to a plane between the poles 
of the Stern-Gerlach magnets, the particle whose spin component is being "measured" cannot 
cross this plane of symmetry, so that if the particle is initially above, respectively below, the 
symmetry plane, it will remain above, respectively below, that plane. But because their magnets 
have opposite polarity, and S \^ involve opposite calibrations: F| = —F^. It follows that 

^ <? — 

and the two experiments completely disagree about the "value of a^" in this case. 

The essential point illustrated by the previous example is that instead of having in Bohmian 
mechanics a natural association ^ Z^,^^ we have a rather different pattern of relationships, 
given in the example by 

8.4 Against "Contextuality" 

The impossibility theorems require the assumption of noncontextuality, that the random variable 
Z giving the result of a "measurement of quantum observable A" should depend on A alone, 
further experimental details being irrelevant. How big a deal is contextuality, the violation of 
this assumption? Here are two ways of describing the situation: 

1. In quantum mechanics (or quantum mechanics supplemented with hidden variables), ob- 
servables and properties have a novel, highly nonclassical aspect: they (or the result of 
measuring them) depend upon which other compatible properties, if any, are measured 
together with them. 

In this spirit, Bohm and Hiley ^Hl write that (page 109) 
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the quantum properties imply . . . that measured properties are not intrinsic but 
are inseparably related to the apparatus. It follows that the customary language 
that attributes the results of measurements ... to the observed system alone can 
cause confusion, unless it is understood that these properties are actually depen- 
dent on the total relevant context. 

They later add that (page 122) 

The context dependence of results of measurements is a further indication of 
how our interpretation does not imply a simple return to the basic principles 
of classical physics. It also embodies, in a certain sense, Bohr's notion of the 
indivisibility of the combined system of observing apparatus and observed object. 

2. The result of an experiment depends upon the experiment. Or, as expressed by Bell |10j 
(pg.l66), 

A final moral concerns terminology. Why did such serious people take so seri- 
ously axioms which now seem so arbitrary? I suspect that they were misled by 
the pernicious misuse of the word 'measurement' in contemporary theory. This 
word very strongly suggests the ascertaining of some preexisting property of some 
thing, any instrument involved playing a purely passive role. Quantum exper- 
iments are just not like that, as we learned especially from Bohr. The results 
have to be regarded as the joint product of 'system' and 'apparatus,' the com- 
plete experimental set-up. But the misuse of the word 'measurement' makes it 
easy to forget this and then to expect that the 'results of measurements' should 
obey some simple logic in which the apparatus is not mentioned. The resulting 
difficulties soon show that any such logic is not ordinary logic. It is my impres- 
sion that the whole vast subject of 'Quantum Logic' has arisen in this way from 
the misuse of a word. I am convinced that the word 'measurement' has now 
been so abused that the field would be significantly advanced by banning its use 
altogether, in favour for example of the word 'experiment.' 

With one caveat, we entirely agree with Bell's observation. The caveat is this: We do not 
believe that the difference between quantum mechanics and classical mechanics is quite as crucial 
for Bell's moral as his language suggests it is. For any experiment, quantum or classical, it would 
be a mistake to regard any instrument involved as playing a purely passive role, unless the 
experiment is a genuine measurement of a property of a system, in which case the result is 
determined by the initial conditions of the system alone. However, a relevant difference between 
classical and quantum theory remains: Classically it is usually taken for granted that it is in 
principle possible to measure any observable without seriously affecting the observed system, 
which is clearly false in quantum mechanics (or Bohmian mechanics). 

Mermin has raised a similar question [62 (pg. 811): 

Is noncontextuality, as Bell seemed to suggest, as silly a condition as von Neu- 
mann's . . . ? 

To this he answers: 

■*^The assumption could (and probably should) also be questioned classically. 
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I would not characterize the assumption of noncontextuahty as a silly constraint on 
a hidden-variables theory. It is surely an important fact that the impossibility of 
embedding quantum mechanics in a noncontextual hidden-variables theory rests not 
only on Bohr's doctrine of the inseparability of the objects and the measuring instru- 
ments, but also on a straightforward contradiction, independent of one's philosophic 
point of view, between some quantitative consequences of noncontextuahty and the 
quantitative predictions of quantum mechanics. 

This is a somewhat strange answer. First of all, it applies to von Neumann's assumption (lin- 
earity), which Mermin seems to agree is silly, as well as to the assumption of noncontextuahty. 
And the statement has a rather question-begging flavor, since the importance of the fact to 
which Mermin refers would seem to depend on the nonsilliness of the assumption which the fact 
concerns. 

Be that as it may, Mermin immediately supplies his real argument for the nonsilliness of 
noncontextuahty. Concerning two experiments for "measuring observable A" he writes that 

it is . . . an elementary theorem of quantum mechanics that the joint distribution . . . 

for the first experiment yields precisely the same marginal distribution (for A) as 
does the joint distribution ... for the second, in spite of the different experimental 
arrangements. . . . The obvious way to account for this, particularly when entertaining 
the possibility of a hidden- variables theory, is to propose that both experiments reveal 
a set of values for A in the individual systems that is the same, regardless of which 
experiment we choose to extract them from. ... A contex^uai hidden- variables account 
of this fact would be as mysteriously silent as the quantum theory on the question of 
why nature should conspire to arrange for the marginal distributions to be the same 
for the two different experimental arrangements. 

A bit later, Mermin refers to the "striking insensitivity of the distribution to changes in the 
experimental arrangement." 

For Mermin there is a mystery, something that demands an explanation. It seems to us, 
however, that the mystery here is very much in the eye of the beholder. It is first of all somewhat 
odd that Mermin speaks of the mysterious silence of quantum theory concerning a question whose 
answer, in fact, emerges as an "elementary theorem of quantum mechanics." What better way 
is there to answer questions about nature than to appeal to our best physical theories? 

More importantly, the "two different experimental arrangements," say S'l and S'2, considered 
by Mermin are not merely any two randomly chosen experimental arrangements. They obviously 
must have something in common. This is that they are both associated with the same self-adjoint 
operator A in the manner we have described: S'l A and S'2 ^ A. It is quite standard to say in 
this situation that both Si and S2 measure the observable A, but both for Bohmian mechanics 
and for orthodox quantum theory the very meaning of the association with the operator A is 
merely that the distribution of the result of the experiment is given by the spectral measures 
for A. Thus there is no mystery in the fact that Si and S2 have results governed by the same 
distribution, since, when all is said and done, it is on this basis, and this basis alone, that we are 
comparing them. 

(One might wonder how it could be possible that there are two different experiments that are 
related in this way. This is a somewhat technical question, rather different from Mermin's, and it 

is one that Bohmian mechanics and quantum mechanics readily answer, as we have explained in 
this paper. In this regard it would probably be good to reflect further on the simplest example of 
such experiments, the Stern-Gerlach experiments Si and Si discussed in the previous subsection.) 
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It is also difficult to see how Mermin's proposed resolution of the mystery, "that both experi- 
ments reveal a set of values for A . . . that is the same, regardless of which experiment we choose 
to extract them from," could do much good. He is faced with a certain pattern of results in two 
experiments that would be explained if the experiments did in fact genuinely measure the same 
thing. The experiments, however, as far as any detailed quantum mechanical analysis of them 
is concerned, don't appear to be genuine measurements of anything at all. He then suggests 
that the mystery would be resolved if, indeed, the experiments did measure the same thing, the 
analysis to the contrary notwithstanding. But this proposal merely replaces the original mystery 
with a bigger one, namely, of how the experiments could in fact be understood as measuring the 
same thing, or anything at all for that matter. It is like explaining the mystery of a talking cat 
by saying that the cat is in fact a human being, appearances to the contrary notwithstanding. 

A final complaint about contextuality: the terminology is misleading. It fails to convey with 
sufficient force the rather definitive character of what it entails: "Properties" that are merely 
contextual are not properties at all; they do not exist, and their failure to do so is in the strongest 
sense possible! 



8.5 Nonlocality, Contextuality and Hidden Variables 

There is, however, a situation where contextuality is physically relevant. Consider the EPRB 
experiment, outlined at the end of Section IH.fil In this case the dependence of the result of a 
measurement of the spin component cti • a of a particle upon which spin component of a distant 
particle is measured together with it — the difference between ^'o-^.a, 0-2 b and Za-^ a^ 0-2 c (using the 
notation described in the seventh paragraph of Section 18. 3|) — is an expression of nonlocality, of, 
in Einstein words, a "spooky action at distance." More generally, whenever the relevant context 
is distant, contextuality implies nonlocality. 

Nonlocality is an essential feature of Bohmian mechanics: the velocity, as expressed in the 
guiding equation ()2.2|) . of any one of the particles of a many-particle system will typically depend 
upon the positions of the other, possibly distant, particles whenever the wave function of the 
system is entangled, i.e., not a product of single-particle wave functions. In particular, this is 
true for the EPRB experiment under examination. Consider the extension of the single particle 
Hamiltonian ()2.12|) to the two-particle case, namely 

H = - - /iicri-B(xi) - /i2cr2-B(x2). 

2mi 2m2 

Then for initial singlet state, and spin measurements as described in Sections 12.51 and 15.21 it 
easily follows from the laws of motion of Bohmian mechanics that 

•^(Ti -a, <T2 b 7^ ^cr\ a, <T2 C • 

This was observed long ago by Bell |j6j. In fact. Bell's examination of Bohmian mechanics 
led him to his celebrated nonlocality analysis. In the course of his investigation of Bohmian 
mechanics he observed that ([TU], p. 11) 

in this theory an explicit causal mechanism exists whereby the disposition of one piece 
of apparatus affects the results obtained with a distant piece. 
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Bohm of course was well aware of these features of his scheme, and has given them 
much attention. However, it must be stressed that, to the present writer's knowledge, 
there is no proof that any hidden variable account of quantum mechanics must have 
this extraordinary character. It would therefore be interesting, perhaps, to pursue 
some further "impossibility proofs," replacing the arbitrary axioms objected to above 
by some condition of locality, or of separability of distant systems. 

In a footnote. Bell added that "Since the completion of this paper such a proof has been found." 
This proof was published in his 1964 paper [SI, "On the Einstein-Podolsky- Rosen Paradox," in 
which he derives Bell's inequality, the basis of his conclusion of quantum nonlocality. 

We find it worthwhile to reproduce here the analysis of Bell, deriving a simple inequality 
equivalent to Bell's, in order to highlight the conceptual significance of Bell's analysis and, at 
the same time, its mathematical triviality. The analysis involves two parts. The first part, the 
Einstein-Podolsky- Rosen argument applied to the EPRB experiment, amounts to the observation 
that for the singlet state the assumption of locality implies the existence of noncontextual hidden 
variables. More precisely, it implies, for the singlet state, the existence of random variables 

= Za(Ti, ^ = 1,2, corresponding to all possible spin components of the two particles, that 
obey the agreement condition described in Section IH^ In particular, focusing on components in 
only 3 directions a, b and c for each particle, locality implies the existence of 6 random variables 

i = 1, 2 CK = a, b, c 

such that 

ZL = ±1 (8.12) 

= -Zl (8.13) 

and, more generally, 

Prob(Zi ^ Zl) = (8.14) 

the corresponding quantum mechanical probabilities. This conclusion amounts to the idea that 
measurements of the spin components reveal preexisting values (the Z^^), which, assuming locality, 
is implied by the perfect quantum mechanical anticorrelations jH]: 

Now we make the hypothesis, and it seems one at least worth considering, that if the 
two measurements are made at places remote from one another the orientation of one 
magnet does not influence the result obtained with the other. Since we can predict in 
advance the result of measuring any chosen component of (72, by previously measuring 
the same component of ai, it follows that the result of any such measurement must 
actually be predetermined. 

People very often fail to appreciate that the existence of such variables, given locality, is not 
an assumption but a consequence of Bell's analysis. Bell repeatedly stressed this point (by 
determinism Bell here means the existence of hidden variables) : 

It is important to note that to the limited degree to which determinism plays a 
role in the EPR argument, it is not assumed but inferred. What is held sacred is the 
principle of 'local causality' - or 'no action at a distance'. . . . 

It is remarkably difficult to get this point across, that determinism is not a pre- 
supposition of the analysis. (^iQj, p. 143) 

Despite my insistence that the determinism was inferred rather than assumed, you 
might still suspect somehow that it is a preoccupation with determinism that creates 
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the problem. Note well then that the following argument makes no mention whatever 
of determinism. . . . Finally you might suspect that the very notion of particle, and 
particle orbit . . . has somehow led us astray. ... So the following argument will 
not mention particles, nor indeed fields, nor any other particular picture of what 
goes on at the microscopic level. Nor will it involve any use of the words 'quantum 
mechanical system', which can have an unfortunate effect on the discussion. The 
difficulty is not created by any such picture or any such terminology. It is created 
by the predictions about the correlations in the visible outputs of certain conceivable 
experimental set-ups. ([in], P- 150) 

The second part of the analysis, which unfolds the "difficulty . . . created by the . . . correla- 
tions," involves only very elementary mathematics. Clearly, 

Prob {{Zl = Zl} U {Zl = Zl) U {Zl = Zi}) = 1 . 

since at least two of the three (2-valued) variables must have the same value. Hence, by 
elementary probability theory, 

Prob {Zl = Zl) + Prob (Z^ = Zl) + Prob {Zl = Z^) > 1, 

and using the perfect anticorrelations ()8.13|) we have that 

Prob {Zl = -Zl) + Prob (Z^ = -Z^) + Prob {Z^ = -Zl) > 1, (8.15) 

which is equivalent to Bell's inequality and in conflict with fl8.14j) . For example, when the angles 
between a, b and c are 120° the 3 relevant quantum correlations gct/3 sue all 1/4. 

To summarize the argument, let H be the hypothesis of the existence of the noncontextual 
hidden variables we have described above. Then the logic of the argument is: 

Parti: quantum mechanics -|- locality =^ H (8.16) 

Part 2: quantum mechanics =^ not H (8-17) 

Conclusion: quantum mechanics =^ not locality (8.18) 

To fully grasp the argument it is important to appreciate that the identity of H — the existence 
of the noncontextual hidden variables — is of little substantive importance. What is important is 
not so much the identity of H as the fact that H is incompatible with the predictions of quantum 
theory. The identity of H is, however, of great historical significance: It is responsible for the 
misconception that Bell proved that hidden variables are impossible, a belief until recently almost 
universally shared by physicists. 

Such a misconception has not been the only reaction to Bell's analysis. Roughly speaking, 
we may group the different reactions into three main categories, summarized by the following 
statements: 

1. Hidden variables are impossible. 

2. Hidden variables are possible, but they must be contextual. 

3. Hidden variables are possible, but they must be nonlocal. 
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Statement 1 is plainly wrong. Statement 2 is correct but not terribly significant. Statement 3 
is correct, significant, but nonetheless rather misleading. It follow from (j8.16|) and (|8.17|) that 
any account of quantum phenomena must be nonlocal, not just any hidden variables account. 
Bell's argument shows that nonlocality is implied by the predictions of standard quantum theory 
itself. Thus if nature is governed by these predictions, then nature is nonlocal. (That nature is 
so governed, even in the crucial EPR-correlation experiments, has by now been established by a 
great many experiments, the most conclusive of which is perhaps that of Aspect 0.) 

9 Against Naive Realism About Operators 

Traditional naive realism is the view that the world is pretty much the way it seems, populated by 
objects which force themselves upon our attention as, and which in fact are, the locus of sensual 
qualities. A naive realist regards these "secondary qualities," for example color, as objective, 
as out there in the world, much as perceived. A decisive difficulty with this view is that once 
we understand, say, how our perception of what we call color arises, in terms of the interaction 
of light with matter, and the processing of the light by the eye, and so on, we realize that the 
presence out there of color per se would play no role whatsoever in these processes, that is, in 
our understanding what is relevant to our perception of "color." At the same time, we may also 
come to realize that there is, in the description of an object provided by the scientific world-view, 
as represented say by classical physics, nothing which is genuinely "color-like." 

A basic problem with quantum theory, more fundamental than the measurement problem 
and all the rest, is a naive realism about operators, a fallacy which we believe is far more serious 
than traditional naive realism: With the latter we are deluded partly by language but in the 
main by our senses, in a manner which can scarcely be avoided without a good deal of scientific 
or philosophical sophistication; with the former we are seduced by language alone, to accept a 
view which can scarcely be taken seriously without a large measure of (what often passes for) 
sophistication. 

Not many physicists — or for that matter philosophers — have focused on the issue of naive 
realism about operators, but Schrodinger and Bell have expressed similar or related concerns: 

. . . the new theory [quantum theory] . . . considers the [classical] model suitable for 
guiding us as to just which measurements can in principle be made on the relevant 
natural object. . . .Would it not be pre-established harmony of a peculiar sort if the 
classical-epoch researchers, those who, as we hear today, had no idea of what measur- 
ing truly is, had unwittingly gone on to give us as legacy a guidance scheme revealing 
just what is fundamentally measurable for instance about a hydrogen atom!? [72] 

Here are some words which, however legitimate and necessary in application, have no 
place in a formulation with any pretension to physical precision: system; apparatus; 
environment; microscopic, macroscopic; reversible, irreversible; observable; informa- 
tion; measurement. 

. . . The notions of "microscopic" and "macroscopic" defy precise definition. 
. . . Einstein said that it is theory which decides what is "observable" . I think he 
was right. . . . "observation" is a complicated and theory-laden business. Then that 
notion should not appear in the formulation of fundamental theory. . . . 

On this list of bad words from good books, the worst of all is "measurement". It 
must have a section to itself. ^T] 
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We agree almost entirely with Bell here. We insist, however, that "observable" is just as bad 
as "measurement," maybe even a little worse. Be that as it may, after listing Dirac's measurement 
postulates Bell continues: 

It would seem that the theory is exclusively concerned about "results of measure- 
ment", and has nothing to say about anything else. What exactly qualifies some 
physical systems to play the role of "measurer" ? Was the wave function of the world 
waiting to jump for thousands of millions of years until a single-celled living creature 
appeared? Or did it have to wait a little longer, for some better qualified system 
. . .with a Ph.D.? If the theory is to apply to anything but highly ideahzed labora- 
tory operations, are we not obliged to admit that more or less "measurement-like" 
processes are going on more or less all the time, more or less everywhere. Do we not 
have jumping then all the time? 

The first charge against "measurement", in the fundamental axioms of quantum 
mechanics, is that it anchors the shifty split of the world into "system" and "appa- 
ratus" . A second charge is that the word comes loaded with meaning from everyday 
life, meaning which is entirely inappropriate in the quantum context. When it is said 
that something is "measured" it is difficult not to think of the result as referring to 
some preexisting property of the object in question. This is to disregard Bohr's insis- 
tence that in quantum phenomena the apparatus as well as the system is essentially 
involved. If it were not so, how could we understand, for example, that "measure- 
ment" of a component of "angular momentum" . . .in an arbitrarily chosen direction 
. . . yields one of a discrete set of values? When one forgets the role of the appara- 
tus, as the word "measurement" makes all too likely, one despairs of ordinary logic 
. . . hence "quantum logic" . When one remembers the role of the apparatus, ordinary 
logic is just fine. 

In other contexts, physicists have been able to take words from ordinary language 
and use them as technical terms with no great harm done. Take for example the 
"strangeness", "charm", and "beauty" of elementary particle physics. No one is 
taken in by this "baby talk" .... Would that it were so with "measurement" . But in 
fact the word has had such a damaging effect on the discussion, that I think it should 
now be banned altogether in quantum mechanics. {Ibid.) 

While Bell focuses directly here on the misuse of the word "measurement" rather than on 
that of "observable," it is worth noting that the abuse of "measurement" is in a sense inseparable 
from that of "observable," i.e., from naive realism about operators. After all, one would not be 
very likely to speak of measurement unless one thought that something, some "observable" that 
is, was somehow there to be measured. 

Operationalism, so often used without a full appreciation of its consequences, may lead many 
physicists to beliefs which are the opposite of what one might expect. Namely, by believing 
somehow that a physical property is and must be defined by an operational definition, many 
physicists come to regard properties such as spin and polarization, which can easily be opera- 
tionally defined, as intrinsic properties of the system itself, the electron or photon, despite all the 
difficulties that this entails. If operational definitions were banished, and "real definitions" were 
required, there would be far less reason to regard these "properties" as intrinsic, since they are 
not defined in any sort of intrinsic way; in short, we have no idea what they really mean, and 
there is no reason to think they mean anything beyond the behavior exhibited by the system in 
interaction with an apparatus. 

There are two primary sources of confusion, mystery and incoherence in the foundations of 
quantum mechanics: the insistence on the completeness of the description provided by the wave 
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function, despite the dramatic difficulties entailed by this dogma, as illustrated most famously by 
the measurement problem; and naive realism about operators. While the second seems to point 
in the opposite direction from the first, the dogma of completeness is in fact nourished by naive 
realism about operators. This is because naive realism about operators tends to produce the 
belief that a more complete description is impossible because such a description should involve 
preexisting values of the quantum observables, values that are revealed by measurement. And 
this is impossible. But without naive realism about operators — without being misled by all the 
quantum talk of the measurement of observables — most of what is shown to be impossible by the 
impossibility theorems would never have been expected to begin with. 
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